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Foreword 


This  is  the  first  of  a  series  of  four  volumes  which  are  to  contain  the 
Proceedings  of  the  Summer  Seminar  on  Applied  Mathematics, 
arranged  by  the  American  Mathematical  Society  and  held  at  the 
University  of  Colorado  for  the  period  July  24  through  August  19, 
I960.  The  Seminar  was  under  the  sponsorship  of  the  National 
Science  Foundation,  Office  of  Naval  Research,  Atomic  Energy 
Commission,  and  the  Office  of  Ordnance  Research. 

For  many  years  there  was  an  increasing  barrier  between  mathematics 
and  modern  physics.  The  separation  of  these  two  fields  was  regret¬ 
table  from  the  point  of  view  of  each— physical  theories  were  largely 
isolated  from  the  newer  advances  in  mathematics,  and  mathematics 
itself  lacked  contact  with  one  of  the  most  stimulating  intellectual 
developments  of  our  times.  During  recent  years,  however,  mathe¬ 
maticians  and  physicists  have  displayed  alacrity  fpr  mutual  exchange. 
This  Seminar  was  designed  to  enlarge  the  much-needed  contact  which 
has  begun  to  develop. 

The  purpose  of  the  Seminar  was  primarily  instructional,  with 
emphasis  on  basic  courses  in  classical  quantum  theory,  quantum 
theory  of  fields  and  elementary  particles,  and  statistical  physics, 
supplemented  by  lectures  specially  planned  to  complement  them. 
The  publication  of  these  volumes  is  intended  to  extend  the  same 
information  presented  at  the  Seminar  to  a  much  wider  public  than 
was  privileged  to  actually  attend,  while  at  the  same  time  serving  as  a 
permanent  reference  for  those  who  did  attend. 

Following  are  members  of  a  committee  who  organized  the  program 
of  the  Seminar: 

Kurt  O.  Friedrichs,  Chairman 
Mark  Kac 

Menahem  M.  Schiffer 
George  E.  Uhlenbeck 
Eugene  P.  Wigner 

Local  arrangements,  including  the  social  and  recreational  program. 
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FOREWORD 


were  organized  by  a  committee  from  the  University  of  Colorado,  as 
follows: 

Charles  A.  Hutchinson 
Robert  W.  Eliingwood 

The  enduring  vitality  and  enthusiasm  of  the  chairmen,  and  the 
cooperation  of  other  members  of  the  university  staff,  made  the  stay 
of  the  participants  extremely  pleasant;  and  the  four  agencies  which 
supplied  financial  support,  as  acknowledged  on  the  copyright  page, 
together  with  the  Admissions  Committee,  consisting  of  Bernard 
Friedman,  Wilfred  Kaplan,  and  Kurt  O.  Friedrichs,  Chairman,  also 
contributed  immeasurably  to  the  successful  execution  of  the  plans  for 
the  Seminar. 

The  Seminar  opened  with  an  address  given  by  Professor  Mark  Kac, 
Department  of  Mathematics,  Cornell  University,  on  the  subject  “A 
Mathematician's  Look  at  Physics:  What  Sets  us  Apart  and  What 
May  Bring  us  Together."  Afternoons  were  purposely  kept  free  to 
give  participants  a  chance  to  engage  in  informal  seminars  and 
discussions  among  themselves  and  with  the  distinguished  speakers  on 
the  program. 

Editorial  Committee 

V.  Bargmann 
G.  Uhlenbeck 
M.  Kac,  Chairman 
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Preface 


These  lectures  were  given  by  one  of  us  (G.  E.  U.)  as  a  part  of  the 
summer  symposium  in  theoretical  physics,  which  was  organized  by  the 
American  Mathematical  Society  at  the  University  of  Colorado  in  the 
summer  of  1960.  The  lectures  are  printed  here  almost  in  the  same 
form  as  they  were  presented,  which  explains  the  rather  colloquial 
style  and  the  perhaps  excessive  use  of  the  first  personal  pronoun. 
This  last  feature  does  not  mean  that  the  content  of  the  lectures  is  due 
only  to  the  first  author.  All  lectures  were  thoroughly  discussed  and 
prepared  by  both  of  us,  and  most  of  the  developments  for  which  some 
originality  may  be  claimed  are  the  result  of  the  collaboration  of  the 
two  authors  during  many  years. 

The  purpose  of  the  summer  symposium  was  to  acquaint  a  group  of 
mathematicians  with  some  of  the  basic  problems  in  present-day 
theoretical  physics,  hoping  that  this  would  stimulate  a  more  intense 
collaboration.  In  our  opinion  such  a  collaboration  would  be  especially 
valuable  in  statistical  mechanics,  since  here  many  of  the  unsolved 
problems  can  be  formulated  precisely  and  are  of  a  technical  mathe¬ 
matical  nature.  Furthermore,  because  of  the  lack  of  exact  knowledge 
of  the  intermolecular  forces,  one  is  usually  more  interested  in  the 
explanation  of  the  qualitative  features  of  the  macroscopic  phenomena 
than  in  the  precise  quantitative  prediction  of  the  macroscopic  quantities. 
This  qualitative  aspect  of  the  theory  should  appeal  to  the  mathe¬ 
matician  while  for  the  physicist  it  is  often  a  real  difficulty.  Since  the 
facts  are  so  well  known  the  latter  is  often  tempted  to  be  satisfied  with 
more  or  less  uncontrolled  approximations  based  on  intuitive  argu- 
•  ments.  These  are  often  very  valuable,  but  usually  they  do  not  provide, 

so  to  say,  a  foothold  for  a  rigorous  mathematical  treatment.  In  our 
opinion  such  a  treatment  is  especially  needed  in  statistical  mechanics, 
and  to  provide  it  is  the  real  challenge  of  the  subject. 

We  have  therefore  stressed  as  much  as  possible  the  logical  structure 
of  the  theory,  and  we  have  always  tried  to  indicate  the  mathematical 
gaps  which  remain  in  the  argument.  In  addition  we  have  tried  to 
start  from  the  beginning  and  to  avoid  as  much  as  possible  the  phrase: 
it  can  be  shown.  Proofs  are  often  put  in  the  notes  at  the  end  of  each 
chapter  together  with  references  to  the  literature.  We  hope  that  as 
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a  result  the  book  can  be  used  as  a  short  but  self-contained  introduction 
to  the  subject,  although  we  realize  that  it  will  have  to  be  used  with 
tact,  since  the  textbook-like  chapters  alternate  with  chapters  describing 
work  still  in  progress. 

To  achieve,  even  only  approximately,  these  rather  conflicting  goals, 
it  soon  became  clear  to  us  that  we  had  to  limit  the  field  considerably. 
Not  only  have  we  restricted  ourselves  to  the  simplest  possible  physical 
systems  (namely  mono-atomic  gases)  but  even  for  these  systems  we 
could  only  discuss  a  few  characteristic  problems.  The  most  severe 
limitation  we  had  to  impose  was  the  restriction  to  the  classical  theory. 
This  is  fortunately  somewhat  mitigated  by  the  two  lectures  Dr.  E. 
Montroll  gave,  which  are  reproduced  in  the  Appendix.  Here  an 
account  is  given  of  some  of  the  recent  developments  in  quantum 
statistics  to  which  Dr.  Montroll  himself  has  made  such  important 
contributions. 

In  one  respect  we  did  not  want  to  limit  ourselves.  It  is  often 
customary  either  to  consider  only  the  statistical  problems  for  the 
equilibrium  state  (sometimes  called  statistical  thermodynamics),  or  to 
concentrate  on  the  explanation  of  the  irreversible  and  transport 
phenomena  (sometin.es  called  kinetic  theory).  It  seems  to  us  that 
such  a  limitation  gives  a  distorted  view  of  the  subject,  and  that  one 
should  try  to  give  a  unified  treatment  both  of  equilibrium  and  non¬ 
equilibrium  statistical  mechanics.  We  have  therefore  devoted  six 
lectures  (reproduced  in  the  first  three  chapters)  to  equilibrium  prob¬ 
lems  and  six  lectures  (reproduced  in  Chapters  4-7)  to  non-equilibrium 
problems,  and  we  have  attempted  to  point  out  at  least  the  conceptional 
connections  between  the  two  fields. 

Finally  there  remains  the  pleasant  duty  to  thank  in  the  first  place 
Professor  Mark  Kac  for  all  the  help  and  advice  he  has  given  us.  If 
these  lectures  would  inspire  one  other  mathematician  of  the  calibre  of 
Professor  Kac  to  work  on  the  problems  of  statistical  mechanics  the 
symposium  would  have  more  than  justified  itself!  The  reader  would 
also  be  well  advised  to  read  in  parallel  with  these  lectures  the  third 
chapter  in  the  book  Probability  and  Related  Topics  in  Physical  Sciences, 
by  Professor  Kac,  since  the  outlook  is  in  many  respects  the  same  as 
ours.  Then  we  want  to  thank  Professor  T.  H.  Berlin  for  many 
discussions  and  for  his  permission  to  use  a  few  parts  of  a  manuscript 
on  which  he  and  one  of  us  (O.  E.  U.)  have  been  working  for  quite 
some  time. 

G.  E.  Uhlenbeck 

G. W.  Ford 


Chapter  I 


The  Explanation  of  the  Laws 
of  Thermodynamics 


1.  Introduction.  In  this  chapter  I  will  try  to  present  an  outline  of  the 
foundations  of  statistical  mechanics,  and  I  must  apologize  for  the  fact 
that  it  will  not  be  a  clear,  axiomatic  presentation.  This  is  not  because 
I  am,  as  some  physicists  are,  impatient  with  general  discussions  of 
fundamentals.  On  the  contrary,  I  believe  that  further  progress  is 
intimately  connected  with  a  further  clarification  of  the  foundations. 
However,  at  present  there  is  no  generally  accepted  opinion  of  what  the 
basic  assumptions  of  the  theory  are,  and  as  a  result  I  can  only  present 
my  own  point  of  view,  which  is,  I  believe,  a  kind  of  paraphrase  of  the 
fundamental  ideas  of  Boltzmann  and  Gibbs.  And  since  these  ideas 
were  mainly  developed  in  the  attempt  of  explaining  the  laws  of  thermo¬ 
dynamics  from  the  molecular  theory  of  matter,  I  will  follow  also  in 
this  respect  in  the  footsteps  of  the  two  founders  of  statistical 
mechanics.1* 

First,  let  me  remind  you  of  the  general  problem  of  statistical  physics. 
Given  the  structure  and  the  laws  of  interaction  of  the  molecules, 
what  are  the  macroscopic  properties  of  the  matter  composed  of  these 
molecules?  To  start  one  has  therefore  to  say  something  about  the 
molecular  model  and  about  the  basic  microscopic  laws.  For  sim¬ 
plicity  I  will  almost  always  assume: 

(a)  The  motion  of  the  molecules  is  governed  by  classical  mechanics. 
It  is  true  that  the  quantum  mechanics  adds  new  features  to  the  prob¬ 
lem;  it  may  even  be  that  the  act  of  measurement  introduces  a  “true” 
irreversibility  into  the  theory  as  some  authors  claim.  But  most  of  the 
essential  questions  arise  already  in  the  classical  theory,  and  since  it  is 
more  familiar  to  you,  I  will  restrict  myself  to  this  theory. 

(b)  The  molecules  are  mass  points  interacting  through  central 
forces,  which  have  the  additivity  property  and  which  are  of  the  van 


•  Numbers  refer  to  Notes  at  end  of  Chapter. 

I 


2  THE  EXPLANATION  OF  THE  LAWS  OF  THERMODYNAMICS 


der  Waals  or  molecular  type.  This  means  that  the  given  Hamiltonian 
of  the  system  of  N  particles  is  of  the  form: 

(,)  H=  i  K  +  ^>1  +  2#lr‘-  r/l) 

i- 1  i"n  j  i<i 

where  the  interaction  potential  #»  has  the  typical  form  shown  in 
Fig.  1.  It  consists  of  an  attractive  region  with  finite  range  rx  and  a 


very  steep  repulsive  part  which  can  be  taken  as  a  hard  core  of  radius 
r0.2  In  Eq.  (1)  U(rt)  is  the  potential  of  outside  forces  acting  on  the 
ith  particle  and  includes  the  potential  corresponding  to  the  forces 
exerted  by  the  walls  of  the  container  so  as  to  keep  the  particles  inside 
the  volume  V.  In  fact  this  “wall  potential"  will  usually  be  assumed 
to  be  the  only  outside  force  acting  on  the  molecules. 

The  limited  validity  of  these  assumptions  should  be  emphasized. 
The  basic  microscopic  laws  are  surely  quantum-mechanical,  and  the 
great  richness  of  the  macroscopic  properties  of  matter  can  only  be 
understood  if  more  complicated  molecular  structures  and  other  types 
of  forces  are  taken  into  account.  Our  assumptions  would  hold  only, 
say,  for  gaseous  or  liquid  Neon.  However,  even  with  these  simplifying 
assumptions  there  are  still  plenty  of  unsolved  problems!  Also  many 
of  the  questions,  which  we  will  consider,  depend  only  on  the  qualita* 
tive  features  of  the  interactions  and  their  treatment  will  therefore  have 
a  much  wider  validity. 
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1  The  Lkwrilic  theorem.  We  represent  the  state  of  a  mechanical 
system  of  n  degrees  of  freedom  by  a  point  in  the  2/i-dimensional  phase 
space  or  F-space.  In  our  case  n  -  3 N,  N  -  number  of  particles, 
and  the  coordinates  of  a  point  in  F-space  are  rt-  •  -rN,  pr  •  pN.  In 
the  course  of  time  this  F-point  will  move  according  to  the  Hamilton 
equations  of  motion : 


(2) 


i  -  1,2,  •  •  •,«. 


Since  these  equations  are  of  first  order  in  the  time  it  is  clear  that  the 
path  of  the  representative  point  in  F-space  will  be  determined  by  the 
initial  point.  To  visualize  the  motion  for  all  possible  initial  points, 
imagine  that  one  has  a  very  large  number  of  copies  of  the  system. 
Each  copy  sends  a  point  in  F-space,  and  if  the  number  is  large  enough, 
one  may  consider  the  collection  of  points  as  a  fluid  streaming  in 
F-space  and  possessing  in  any  point  a  certain  number  density 
piqx-  •  •  -Pn,  f).  The  streamlines  of  the  fluid  motion  are 

identical  with  the  particle  paths  as  determined  by  (2).  With  Gibbs 
we  call  this  the  ensemble  fluid  or  just  an  ensemble  of  mechanical  systems. 
One  should  note  that  the  total  number  of  copies  never  plays  a  role 
and  is  only  introduced  as  an  aid  to  visualize  all  the  possible  motions 
of  the  F-point.  The  ensemble  fluid  is  a  true  continuum,  and  instead 
of  the  streaming  of  the  fluid  it  may  be  better  to  speaV  of  a  continuous 
mapping  of  the  F-space  in  itself.  Soon  also  the  density  p  will  become 
the  probability  density  that  the  system  is  in  a  region  dqx-dpH  of 
F-space,  and  we  shall  then  denote  it  by  £>(</,  •  •  pn,  t)  and  assume  the 
normalization  condition: 


d<h  ■■■  dpn  D  =  1. 


In  addition  if  the  N  particles  are  identical  one  should  require  that  D 
be  a  symmetric  function  of  the  phases  (r,,  p,)  of  the  separate  particles. 

A  theorem  which  plays  a  central  role  is  the  theorem  of  Liouville,3 
which  states  that  the  ensemble  fluid  moves  as  if  it  were  an  incompressible 
fluid. 

To  prove  this,  note  that  the  “velocity”  V  (components  qx-  •  qn. 
Pi  -  ■  • Pn )  fulfills  the  condition: 
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by  virtue  of  the  Hamilton  equations.  Since  one  always  has  the 
continuity  equation: 

|f  +  div(pV)  =  0 
or: 

(4)  ^f  +  P  divV  =  0 


with 

(5) 


*+V 
Dt  bt 


grad 


.  d  . 


it  follows  from  (3)  that: 
(6) 


Dp 

Dt 


0 


which  means  that  p  does  not  change  if  one  moves  with  the  fluid,  or 
the  fluid  is  incompressible. 

If  one  stays  at  a  fixed  point  in  T-space  then  p  changes  according  to: 


or: 

(7) 


U  +  V  grad  p  =  0 


bp  _  £•  /  dH  bp 
ft  ~  ,-i  W  dPi 


bH  bp 
bp,  bq, 


where  {//,  p}  is  the  Poisson  bracket  of  H  and  p.  This  is  the  “  Eulerian  " 
way  of  expressing  the  incompressibility  of  the  fluid.  In  the  “  Lagran- 
gian  ”  way  of  describing  the  motion  of  the  fluid,  the  position  (qx  •••/>„) 
of  a  fluid  element  is  given  as  a  function  of  the  initial  position  (q,0-  ■  pn0) 
and  of  the  time.  The  condition  for  the  incompressibility  of  the  flow 
is  now  that  the  Jacobian: 


(8) 


■  ■  •  Pn)  m  , 
<i<ho  ■;  ■  Pno) 


which  expresses  the  fact  that  if  one  follows  a  whole  volume  of  points 
during  the  flow,  this  volume  always  stays  the  same,  although  of  course 
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its  shape  will  vary.  Or  in  other  words:  the  transformation  of  the 
/'•space  into  itself  induced  by  the  flow  preserves  volumes.  It  is  easy 
to  see  that  the  characterizations  (7)  and  (8)  are  completely  equivalent. 

Two  simple  consequences  of  the  Liouville  theorem  are  worth 
noting:  * 

(a)  For  any  function  F(p)  of  the  density,  the  integral: 

J  •  -  •  jV(p)  dqx-  •  dpn 

taken  over  the  whole  Aspace  is  independent  of  the  time. 

(b)  Any  density  distribution  p  will  be  stationary,  i.e.  will  not  depend 
explicitly  upon  the  time,  if  and  only  if  p  is  constant  along  each  stream¬ 
line.  In  particular  any  distribution  p(H),  which  is  a  function  of  the 
Hamiltonian  H,  will  be  stationary,  although  such  distributions  are  by 
no  means  the  only  stationary  ones. 

3.  The  approach  to  equilibrium;  the  ideas  of  Boltzmann.  How  can  one 

“explain”  the  irreversible  behaviour  of  macroscopic  systems  from  the 
strictly  reversible  mechanical  model?  This  question,  which  I  call  the 
problem  of  Boltzmann,  has  dominated  the  whole  initial  development 
of  statistical  mechanics  and  it  is  still  being  discussed.  In  its  simplest 
form,  one  must  “explain”  in  which  sense  an  isolated  (that  is  a  con¬ 
servative)  mechanical  system  consisting  of  a  very  large  number  of 
molecules  approaches  thermal  equilibrium,  in  which  all  “macro¬ 
scopic"  variables  have  reached  steady  values.  This  is  sometimes 
called  the  zeroth  law  of  thermodynamics  and  it  expresses  the  most 
typical  irreversible  behaviour  of  macroscopic  systems  familiar  from 
common  observation.  * 

The  conflict  with  mechanics  is  most  drastically  shown  if  one  recalls 
the  famous  Poincare  recurrence  theorem.  For  a  conservative  system, 
the  motion  of  the  F -point  will  stay  on  the  energy  surface  H{q^  •  •  •/»„) 
=  E.  Furthermore  the  motion  is  hounded  (in  the  momenta  by  the 
total  energy  E  and  the  finite  attractive  potential  energy;  in  the  co¬ 
ordinates  by  the  volume  in  which  the  molecules  are  enclosed).  For 
such  a  mechanical  system  starting  from  a  point  P  on  the  energy  surface, 
the  theorem  says  that  for  any  region  around  P,  there  is  a  time  T  in 
which  the  phase  point  of  the  system  will  return  to  the  region  (see 
Fig.  2).  Or  in  other  words,  the  motion  is  quasi  periodic,  and  there  is 
apparently  no  trace  of  an  approach  to  equilibrium.  An  outline  of  the 
proof  of  the  Poincare  theorem  is  given  in  the  notes.4 

Let  me  now  try  to  sketch  the  classical  Boltzmann-Gibbs  resolution 
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of  this  conflict.  The  general  idea  is  that  the  two  sides  of  the  conflict 
refer  to  two  different  levels  of  observation  or  description  of  the 
system,  namely  the  molecular  and  the  macroscopic  level.  Clearly 
thermal  equilibrium  is  a  macroscopic  notion.  For  its  description  it 
is  not  necessary  to  know  the  precise  path  of  the  T-point  over  the 
energy  surface.  It  can  be  characterized  by  a  small  number  (small 
compared  to  N )  of  “macroscopic”  quantities.  We  shall  see  that  it  is 
due  to  this  difference  in  description  that  one  can  understand  and 


reconcile  the  apparently  basic  difference  in  the  temporal  behaviour  on 
the  two  levels. 

To  see  how  this  works,  consider  first  with  Boltzmann  the  case  of  an 
ideal  gas.  When  the  intermolecular  forces  can  be  neglected,  then  all 
macroscopic  properties  of  the  system  are  determined  by  the  distribu¬ 
tion  of  the  jY  points  representing  the  coordinates  and  momenta  of  the 
N  molecules  in  the  six  dimensional  phase  space  of  a  single  molecule. 
We  call  this  the  fi-space  distribution.  To  describe  such  a  distribution 
of  a  finite  number  of  points,  one  must  divide  the  total  /x-space  (which 
is  finite  since  it  is  limited  by  the  volume  of  the  vessel  and  the  given 
total  energy)  in  a  large  number  M  of  small  but  finite  regions  or  cells 
w,,  a*a-  •  u>M,  and  one  must  record  at  a  given  time  the  numbers 
nu  «a  ■  -  of  points  in  these  cells.  The  size  of  the  cells  must  on  the 
one  hand  be  large  enough  so  that  the  numbers  n,  are  appreciable  and 
on  the  other  hand  small  enough  so  that  the  n,  really  give  a  presentation 
of  the  density  distribution  of  the  N  points  in  ft-space.  One  some¬ 
times  expresses  this  compromise  by  saying  that  the  cells  must  be 


THE  APPROACH  TO  EQUILIBRIUM;  THE  IDEAS  OF  BOLTZMANN  7 

“physically  infinitesimal”.  It  is  a  compromise  which  always  must  be 
made  in  the  description  of  an  empirical  distribution  of  a  finite  number 
of  discrete  entities. 

Clearly  the  distribution  n3  -  •  •  over  the  cells  tuu  •  •  in  /x-space 
describes  the  state  of  the  gas  much  less  precisely  than  the  description 
by  one  point  in  A  space.  Each  point  in  Aspace  corresponds  to  a 
definite  distribution  in  /x-space,  but  the  reverse  is  not  true.  In  fact 
it  is  easy  to  prove  that  a  given  distribution  in  /x-space  corresponds  to  a 
whole  region  (6yV-dimensional)  in  Aspace  which  has  the  volume: 


(9)  w  =  , 

nx\  »2!  ■  •  •  ««! 

Proof.  Clearly  the  distribution  nt  will  not  change  if  we  move  the 
molecules  around  in  their  cells.  Moving,  say,  one  of  the  molecules 
in  the  first  cell  around,  leaving  all  other  molecules  fixed,  will  move  the 
corresponding  Apoint  over  a  six-dimensional  volume  a,,.  Since  in 
Aspace  the  states  of  the  different  molecules  are  represented  by 
orthogonal  six-dimensional  sub-spaces,  the  motion  of  all  N  molecules 
in  their  cells  will  move  the  Apoint  over  a  “block”  in  Aspace  of  the 
volume: 

(10)  0,1*  U>22  •  •  ■  . 

In  addition  the  distribution  n,  will  not  change  if  we  permute  two 
molecules  in  different  cells  and  the  motion  of  the  molecules  in  their 
cells  after  the  permutation  will  again  produce  a  block  of  size  (10)  in 
Aspace,  which  will  have  no  common  region  with  the  previous  block. 
Since  there  are: 


(ID 


m 

n,!  na!  ■  nM\ 


ways  of  dividing  the  N  molecules  in  groups  nu  «a-  •  •,  the  total  volume 
in  Aspace  will  be  given  by  the  product  of  (10)  and  (11). 

The  set  of  numbers  n,  is  restricted  by  two  subsidiary  conditions: 


n,  =  jV, 

< 

2  e«w*  =  E 


(12) 
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which  express  the  conservation  of  total  number  and  of  total  energy. 
Here  £,  denotes  the  energy  of  a  particle  in  the  cell  to,.  That  one  can 
assign  this  energy  independently  of  the  occupation  numbers  of  the 
cells  is  clearly  due  to  the  assumption  of  the  ideality  of  the  gas.  Note 
also  that  in  ( 1 2)  the  value  of  E  is  not  precisely  determined,  because  the 
c,  will  vary  slightly  when  the  points  move  in  their  cells.  The  E  in 
(12)  is  therefore  not  strictly  the  same  as  the  given  energy  which  deter¬ 
mines  the  energy  surface  H(q,  p)  =  E  in  Aspace  on  which  the  Apoint 
moves.  In  fact  the  energy  condition  (12)  determines  two  neigh¬ 
bouring  energy  surfaces  around  the  given  energy  surface.  The 
region  in  between  we  will  call  the  energy  shell ;  its  thickness  clearly 
depends  again  on  the  finiteness  of  the  cells  <»,  in  /x-space. 


The  volumes  (♦'(/ii,  «*■••)  of  the  different  distributions  /»,  over  the 
cells  u>,  in  jx-space  (as  given  by  (9))  will  cut  out  of  the  energy  shell 
cylindrical  regions  of  different  size.  Now  one  can  prove  the 
theorem:5 

If  N  is  very  large  then  the  so-called  Maxwell-Boltzmann  (M.B.) 
distribution: 

(13)  h,  =  Auiie'Bt< 

corresponds  to  a  volume  W  which  cuts  out  the  overwhelmingly  largest 
portion  of  the  energy  shell.  In  (13)  the  A  and  /3  are  constants  which 
must  be  determined  from  the  subsidiary  conditions  (12).  Schemati¬ 
cally  the  division  of  the  energy  surface  in  A  space  by  the  various 
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volumes  W(t\ a,  na-  •  •)  is  shown  in  Fig.  3.  Boltzmann  now  identifies 
the  M.B.  distribution  in  /x-space  with  the  thermal  equilibrium  state  of 
the  gas,  and  all  other  /x- space  distributions  with  the  macroscopic 
non-equilibrium  states  of  the  gas. 

If  we  now  assume,  that  in  the  motion  of  the  Apoint  over  the  energy 
surface  there  is  no  preference  for  any  portion  of  the  surface  and  that 
in  the  course  of  time  every  accessible  part  of  the  surface  will  be 
reached,  then  it  is  very  plausible  that  the  time  t(A)  during  which  the 
Apoint  is  inside  a  region  A  is  mainly  determined  by  the  magnitude  of 
the  area  of  A.  Clearly  one  can  then  conclude: 

1.  If  the  gas  is  not  in  the  thermal  equilibrium  state  (  =  M.B. 
distribution),  then  it  almost  always  will  go  into  this  state. 

2.  Once  the  gas  is  in  the  equilibrium  state  it  will  almost  always 
stay  there,  although  fluctuations  away  from  equilibrium  will  and 
must  occur  because  of  the  quasi-periodic  character  of  the  motion  of 
the  Apoint. 

This  is  the  Boltzmann  picture.  It  clearly  reconciles  the  reversibility 
of  the  mechanical  motion  as  expressed  by  the  Poincare  theorem 
with  the  existence  of  a  state  of  macroscopic  equilibrium. 

4.  The  generalization  by  Gibbs.  The  ergodic  theorems.  In  the  previous 
section  we  have  avoided  on  purpose  any  mention  of  the  word  proba¬ 
bility  and  of  the  ergodic  hypothesis  so  as  to  emphasize  the  plausibility 
of  the  general  ideas  of  Boltzmann.  In  order  to  generalize  these  ideas 
and  to  show  the  connection  with  the  theory  of  probability  and  with 
the  ergodic  theorems,  let  us  repeat  the  previous  argument  in  a  more 
abstract  fashion. 

Instead  of  describing  the  macroscopic  state  of  the  gas  by  the  numbers 
«i,  of  points  in  the  different  cells  of  jx-space,  one  could  also  say 
that  the  macroscopic  state  of  the  gas  is  defined  by  the  values  of  a 
number  of  macroscopic  variables: 

(14)  }’,  =  x2  ■  ■  ■  xN) 

which  are  functions  of  the  phases  xk  &  (rk,  p„)  of  the  N  particles.  For 
identical  particles  the  functions  /  should  of  course  be  symmetric  in 
the  xk.  In  fact  in  our  case  (ideal  gas),  one  can  take  for  these  functions 

y,  =  £  At(xk),  1=1,2  M 

k  ‘ 

J'w{  °  J 


(15) 

where 


if  x  is  in  cell  <o,  of  /x-space, 
otherwise. 
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The  values  of  the  y,  are  then  clearly  the  numbers  nt  in  the  cells  wh 
corresponding  to  the  point  P  s(xx--  xN)  in  Aspace.  Here  again 
the  question  of  the  size  of  the  cells  wt  comes  up.  Instead  of  saying 
that  they  should  be  physically  infinitesimal,  as  explained  in  §3,  so  as 
to  give  the  "best"  description  of  the  distribution  in  /i-space,  one  can 
also  say  that  the  sizes  of  the  cells  are  essentially  arbitrary  and  depend 
only  on  how  detailed  the  macroscopic  observation  is.  However,  they 
always  must  be  finite,  so  that  corresponding  to  a  given  set  of  values  of 
the  yt  there  is  a  region  in  r -space.  In  addition  it  must  always  be  so 
that  for  one  set  of  values  of  the  macroscopic  variables  yu  when  N  is 
very  large,  the  corresponding  region  in  Aspace  is  overwhelmingly 
the  largest.  Of  course  in  our  case  this  follows  as  we  saw  from  the 
explicit  expression  (9)  for  the  volume  of  the  Aspace  region. 

In  the  general  case,  when  the  intermolecular  forces  are  not  neg¬ 
lected,  it  is  clear  that  the  p- space  distribution  will  not  be  enough  to 
specify  the  macroscopic  state  of  the  system.  For  instance,  often  we 
will  have  to  know  how  many  pairs  of  molecules  are  in  each  other’s 
action  sphere  since  this  will  affect  some  of  the  macroscopic  variables. 
However,  we  will  still  assume  that  the  macroscopic  state  can  be  speci¬ 
fied  by  giving  the  values  of  a  number  of  phase  functions  v,,  so  that 

(a)  each  macroscopic  state  corresponds  to  a  region  in  Aspace  and 
therefore  to  a  portion  of  the  energy  shell,  and  so  that: 

(b)  for  large  N  there  is  one  set  of  values  of  the  y,  which  corresponds 
to  a  region  which  is  overwhelmingly  the  largest.6 

For  a  given  macroscopic  description,  that  is  for  a  given  choice  of 
the  variables  .»•„  the  energy  shell  will  then  be  divided  in  fixed  regions 
as  indicated  schematically  in  Fig.  3.  Suppose  now  that  at  t  =  0,  one 
has  "prepared"  the  system  in  a  definite  non-equilibrium  macroscopic 
state.  With  this  initial  macroscopic  state  we  associate  an  initial  Aspace 
probability  distribution  D(P,  t  =  0)  which  is  different  from  zero  only 
in  the  corresponding  region  of  the  energy  shell.  Inside  this  region 
D{P,  0)  is  in  principle  arbitrary,  except  that  it  should  be  a  sufficiently 
smooth  function  of  P.  One  can  take  for  instance  D(P,  0)  constant 
inside  the  region  and  zero  otherwise.  Note  that  we  now  write  D  for 
the  density  distribution  p;  D  is  normalized  to  unity. 

This  is  the  only  probability  assumption  which  one  makes,  and  one 
can  argue  that  it  is  the  simplest  assumption  which  is  consistent  with 
our  macroscopic  knowledge.  From  the  Liouville  equation  one  then 
can  find  in  principle  D{P,t).  It  is  the  character  of  the  change 
D(P,0)  *  D(P,t)  which  determines  everything.  Gibbs  in  the 
notorious  Chapter  12  of  his  book  describes  it  about  as  follows.  The 
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volume  of  the  region  where  D  ¥=  0,  remains  according  to  Liouville 
the  same,  but  the  shape  will  change  drastically.  Especially  because  of 
the  sharp  intermolecular  forces  two  Apoints  which  are  originally 
close  together  will  soon  be  far  apart.  As  a  result  the  volume  of  the 


Figure  4 


initial  region  will  be  drawn  out  to  a  very  long  and  thin  thread  which 
more  and  more  will  wind  through  the  whole  energy  shell  (see  Fig.  4; 
note  that  one  should  always  think  of  the  energy  shell  of  points  such 
that  E  ^  H  S  E  +  AE,  so  that  the  thread  is  really  a  thin  ribbon). 
One  can  expect  therefore: 

1.  In  the  course  of  time  the  probability  distribution  D  will  become 
more  and  more  uniform  over  the  energy  shell.  Of  course  this  is 
meant  only  in  the  coarse-grained  sense;  the  distribution  will  always 
consist  of  a  thin  filament  more  or  less  uniformly  wound  through  the 
energy  shell.  The  distribution  which  is  uniform  within  the  energy 
shell  we  will  call  with  Gibbs  the  mkrocanonical  ensemble.  . 

2.  This  approach  to  uniformity  will  continue  when  t  increases, 
although  each  Apoint  will  move  quasi-periodieally  according  to  the 
Poincare  theorem. 

3.  The  time  it  takes  until  uniformity  is  reached  will  depend  essentially 
on  how  one  judges  the  uniformity,  that  is  io  say  on  the  size  of  the  finite 
regions  in  which  the  energy  shell  is  divided  according  to  the  chosen 
macroscopic  description. 


12 


THF  EXPLANATION  OF  THE  LAWS  OF  THERMODYNAMICS 


4.  For  any  macroscopic  description  there  comes  a  time  when  the 
probability  of  each  macroscopic  state  is  determined  by  the  size  of  the 
corresponding  region  of  the  energy  shell.  Since  for  large  N,  there  is 
one  region  which  is  overwhelmingly  the  largest  this  will  then  also  be 
the  most  probable  state  and  it  characterizes  the  thermal  equilibrium 
of  the  system. 

I  hope  I  have  made  clear  the  similarity  between  the  Boltzmann 
picture  and  the  more  general  argument  of  Gibbs.  Of  course  in  both 
cases  a  number  of  statements  were  made  without  proof,  and  they  may 
seem  so  vague  that  they  are  not  amenable  to  proof!  It  is  therefore  of 
great  interest  that  at  least  part  of  the  argument  can  be  formulated 
much  more  precisely  by  means  of  the  Birkhoff  ergodic  theorems.7 
Let  me  remind  you  of  these  theorems: 

(a)  For  a  bounded  mechanical  motion  and  for  any  phase  function 
=  f(P)  which  is  integrable  over  the  energy  surface,  the  time  average : 

(16)  J  =  Lim  ^  f  dtf(P, ) 

r~®  •  Jo 

almost  always  exists.  Here  P ,  denotes  the  phase  point  which  evolves 
from  the  initial  point  P0  by  the  motion.  Furthermore  T  is  independent 
of  the  choice  of  the  initial  point  on  the  trajectory,  but  J  can  vary  with 
the  trajectory. 

I  will  not  try  to  indicate  the  proof  since  it  is  quite  tricky.  All  that 
is  used  is  the  boundedness  of  the  motion  and  the  Liouville  theorem. 
It  is  therefore  certainly  applicable  to  the  Gibbs  macroscopic  variables 

v,. 

(b)  For  a  metrically  transitive  system  the  time  average  (16)  is  inde¬ 
pendent  of  the  trajectory  chosen  and  is  equal  to  the  ensemble  average 

(17)  >v  =  J-  •  jiMAPMH). 

Here  the  integral  goes  over  the  energy  surface  and  a{H)  is  the  dis¬ 
tribution  over  the  energy  surface  which  corresponds  to  the  uniform 
distribution  in  the  energy  shell,  so  that: 


(18) 


"(//)  = 


const. 

I  grad  H  | 


where  the  constant  is  to  be  determined  by  the  normalization  con 
dition: 


J-  •  jrtH)dSl  =  1. 


o{H)  is  called  the  microcanonical  surface  distribution. 
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A  mechanical  system  is  metrically  transitive  if  the  energy  surface 
can  not  be  divided  into  two  finite  regions  such  that  the  orbits  starting 
from  points  in  one  of  the  regions  always  remain  in  that  region.  It  is 
clearly  a  more  precise  formulation  of  the  idea  that  in  course  of  time 
the  path  of  the  T-point  will  fill  the  whole  energy  surface  which  we 
used  in  the  Boltzmann-Gibbs  argument.8  Of  course  it  is  very  difficult 
(and  it  has  not  been  achieved)  to  prove  for  a  given  Hamiltonian  that 
the  motion  is  or  is  not  metrically  transitive,  and  even  examples  which 
can  be  discussed  exactly  are  rare.8  However,  the  advantage  of  a 
precise  formulation  remains.  Note  still  that  in  principle  it  is  not 
necessary  for  the  equality  of  the  time  and  phase  averages  that  the 
mechanical  system  has  a  large  number  of  degrees  of  freedom,  although 
it  seems  more  likely  that  the  system  will  be  metrically  transitive  if 
N  is  large. 

Let  us  now  return  to  the  Boltzmann-Gibbs  picture  of  the  approach 
to  equilibrium.  First,  by  taking  in  (16)  and  (17)  for  f(P)  the  character¬ 
istic  function  of  a  region  A  on  the  energy  surface  (that  is  /(/>)  =  1  if 
P  is  inside  A  and  JXP)  ■=  0  otherwise),  it  follows  that  for  a  metrically 
transitive  system: 


(19) 


Lim 

T  -  * 


HA) 

T 


y(A) 

V 


where  t  is  the  time  the  phase  point  spends  in  A,  and  where  V(A)  is 
the  volume  of  the  part  of  the  energy  shell,  whose  total  volume  is  V, 
which  is  based  on  A.  This  is  clearly  the  precise  formulation  of  the 
assumption  essential  for  the  Boltzmann  picture  that  the  time  t(A)  is 
mainly  determined  by  the  magnitude  of  the  area  of  A.'0 

Also  the  argument  of  Gibbs  can  now  be  expressed  more  precisely. 
The  assumption  that  for  large  N  there  is  one  set  of  values  of  the 
macroscopic  variables  v,  which  correspond  to  by  far  the  largest  part 
of  the  energy  surface,  can  be  replaced  by  the  requirement  that  the 
variables  v,  must  be  normal  variables ."  By  this  we  mean  that  for 
large  N  the  y,  must  have  the  property : 

(20)  !  yf/  -  jv2}12 


or  in  words:  the  fluctuation  of  each  of  the  y\  must  be  very  small 
compared  to  <yr  itself.  Here  the  average  values  are  always  taken  as 
in  (17)  over  the  microcanonical  ensemble.  It  is  then  clear  that  if 
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one  starts  at  f  =  0  with  the  initial  distribution  D(P,  0)  the  expectation 
values  of  the  variables  y,  defined  by : 

(2D  m  =  \  jdafXP)D(p,t) 

will  start  with  the  values  >’,(0)  corresponding  to  D(P,  0)  and  become 
for  /  -►  oo  the  J\(oo)  =  ■' yt>  which  for  a  large  system  are  very  closely 
equal  to  the  equilibrium  values.12  This  is  so,  because  from  the 
ergodic  theorems  and  from  (20)  one  can  conclude  that  for  almost  all 
trajectories  and  for  the  overwhelmingly  largest  time  the  must  have 
constant  values.  Note  still  that  with  the  “Boltzmann”  yt  as  given 
by  (15)  the  f,  are  the  expectation  values  of  the  numbers  n,  in  the  cells 
to,  at  time  /.*  These  are  connected  with  the  so-called  first  distribution 
function  F\(xi,  t)  defined  by; 

(22)  -p  Ffx  „  t)  =  J  ■■■jdx2  ■dxN  D(x1  ■  -xN,  t). 

In  fact  if  one  smoothes  the  n,(r)  over  the  cells  then  the  “histogram” 
becomes  a  smooth  function  fix,  t)dx  where: 

/(.v,  t)  =  J  J dx\  ■  ■  dxN  ^  5(.v  -  xx)D(xx  xN,  t) 

=  y  Ft{X,  t). 

Similarly  to  (22)  one  can  define  the  second  or  pair  distribution  function 
and  in  general  the  .v-tuple  distribution  function  by: 

(23)  F,ixu  x„  t)  ~  V  J  •  •  Jt/.r, , ,  •  •  dxN  D(.v,  •  •  •  xN,  t). 

We  will  come  back  to  these  functions  in  Chapter  VII. 

Let  me  conclude  this  section  with  some  remarks  about  the  bother¬ 
some  notion  of  the  macroscopic  variables  or  of  the  macroscopic 
"observer”,  since  it  has  led  to  so  much  controversy. 

I.  The  notion  of  macroscopic  variable  may  seem  a  bit  vague,  but 
this  cannot  be  helped  as  long  as  the  discussion  is  kept  in  general  terms. 

*  Note  that  in  §3  the  /),  denoted  the  maximizing  set  of  values  of  the  it,,  which  are  the 
equilibrium  values.  For  large  N  they  are  closely  equal  to  what  is  here  denoted  by 

«,( X’ )  or  by  'n,  •. 
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In  Chapter  VII  we  shall  see  that  the  expectation  values  of  the  usual 
macroscopic  variables  (as  stress  tensor,  temperature  distribution,  etc.) 
can  be  found  from  the  first  few  of  the  distribution  functions  (23). 

2.  There  is  an  element  of  arbitrariness  in  the  concept  of  the  macro¬ 
scopic  description  of  the  system,  which  may  seem  objectionable. 
However,  it  is  clear  that  in  principle  the  macroscopic  knowledge  of 
the  state  of  the  system  depends  so  to  say  on  the  zeal  of  the  observer 
and  can  therefore  not  be  defined  in  general.  All  one  can  say  is  that 
the  macroscopic  description  is  a  contracted  description,  which  uses 
much  fewer  variables  than  required  for  the  precise  microscopic  speci¬ 
fication.  Also,  in  practice,  it  usually  is  clear  what  the  macroscopic 
variables  are,  since  they  are  dictated  by  the  experimental  phenomena 
which  one  tries  to  explain.  To  avoid  the  arbitrariness  by  introducing 
the  concept  of  a  probability  distribution  over  all  possible  macroscopic 
observers,13  as  some  authors  have  done,  seems  to  me  only  to  increase 
the  confusion. 

3.  The  arbitrariness  of  the  macroscopic  description  affects  the  time 
required  for  the  expectation  values  >',(/)  to  reach  their  equilibrium 
values  <>-,),  since  the  choice  of  the  y,  determines  the  size  of  the  regions 
in  which  the  energy  shell  is  divided.  One  expresses  this  often  by  saying 
that  the  relaxation  to  equilibrium  depends  on  the  coarse  graining. 
I  think  this  is  surely  the  case,  but  on  the  other  hand  it  is  clear  that  the 
relaxation  to  equilibrium  will  also  depend  on  the  Hamiltonian,  that  is 
on  the  structure  of  the  mechanical  system,  and  it  is  this  dependence  in 
which  one  is  usually  more  interested,  since  the  choice  of  the  y,  is 
fixed  by  the  experimental  situation. 

4.  The  variation  in  time  of  the  y,(t)  in  principle  will  always  depend 
on  the  initial  probability  distribution  D(P,  0)  in  T-space.  This  again 
seems  objectionable,  because  of  the  arbitrariness.  Even  if  one  says 
that  D{P,  0)  should  be  chosen  so  as  to  correspond  with  our  initial 
macroscopic  knowledge  of  the  system,  it  is  clear  that  this  cannot 
possibly  determine  D{P,  0)  completely.  The  answer  is,  I  think,  that 
one  is  only  interested  in  that  part  of  the  relaxation  of  the  y,  to  equili¬ 
brium  which  is  independent  of  the  choice  of  D(P,  0).  One  should 
expect  that  in  a  proper  macroscopic  experiment  this  will  be  the  case 
after  a  short  “  chaotization  period".  Or  one  can  say  that  after  such  a 
short  period  the  temporal  development  of  the  y,  will  be  determined 
by  the  y,  themselves  through  equations  which  are  of  the  first  order  in 
time.  I  call  this  the  requirement  of  macroscopic  causality.  It  is  a 
condition  on  the  choice  of  the  macroscopic  description,  which  clearly 
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can  only  be  fulfilled  in  some  asymptotic  sense.  We  will  come  back 
to  this  in  Chapter  VII. 

5.  The  laws  of  thermodynamics.  We  must  now  discuss  the  connec¬ 
tion  with  or  the  “explanation"  of  the  laws  of  thermodynamics.  In 
the  usual  formulation  of  these  laws,  this  means  that  we  have  to 
explain  the  following  five  basic  facts  and  notions. 

(a)  The  existence  of  thermodynamic  equilibrium  for  a  closed  system. 
This  is  the  so-called  zeroth  law  referred  to  in  §3. 14  Let  me  assume 
that  it  was  sufficiently  elucidated  in  the  previous  sections! 

(b)  The  notion  of  temperature.  In  thermodynamics  the  equili¬ 
brium  is  in  the  first  place  characterized  by  the  temperature.  In  order 
to  represent  the  temperature  by  a  number  one  must  show  besides  the 
existence  of  an  equilibrium  state,  that  the  equilibrium  state  has  the 
transitive  property.  If  system  A  is  in  equilibrium  with  the  systems 
B  and  C  separately  then  B  and  C  are  also  in  equilibrium  between 
themselves.  Here  we  mean  by  equilibrium  of  two  systems  the  fact 
that  the  macroscopic  properties  of  the  systems  will  not  change  when 
the  systems  are  coupled  “weakly”  together.  The  coupling  must 
allow  interchange  of  energy  between  the  systems,  but  the  interaction 
energy  must  be  completely  negligible  compared  to  the  energy  of  each 
of  the  two  systems. 

(c)  The  first  law  of  thermodynamics.  Since  the  molecular  model  is 
always  taken  to  be  a  conservative  mechanical  system,  so  that  the  total 
energy  is  already  conserved,  the  only  question  which  remains  is  that  of  the 
distinction  between  the  notions  of  quantity  of  heat  and  of  external  work. 

(d)  The  second  law  of  thermodynamics  for  reversible  phenomena. 
Knowing  how  the  temperature  T  and  the  quantity  of  heat  Q  have  to 
be  interpreted,  one  must  show  that  for  a  “reversible"  change  of  the 
macroscopic  state  i>Q/T  is  a  perfect  differential  of  a  function  (the 
entropy)  of  the  macroscopic  state  of  the  system. 

(e)  The  second  law  of  thermodynamics  for  irreversible  phenomena. 
This  says  that  in  an  irreversible  or  spontaneous  change  from  one 
equilibrium  state  to  another  (as  for  example  the  equalization  of 
temperature  of  two  bodies  A  and  B ,  when  brought  in  contact)  the 
entropy  always  increases. 

In  the  following  we  will  discuss  these  points  in  succession. 

6.  The  notion  of  temperature.  The  canonical  ensemble.  For  the 

introduction  of  the  notion  of  temperature  we  have  to  discuss  the 
equilibrium  of  the  system  A  under  consideration  when  it  is  weakly 
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coupled  to  a  very  large  “heat  reservoir"  B.  Under  weakly  coupled 
we  mean  that  the  total  Hamiltonian  can  still  be  taken  as: 

(24)  H  =  Ha  +  Hb 

although  A  and  B  interact  and  can  exchange  energy.  We  assume  that 
the  total  system  A  +  B  is  in  equilibrium  in  the  sense  of  the  micro- 
canonical  ensemble.  The  question  is  now  to  find  the  probability 
that  the  system  A  is  in  a  certain  element  drA  of  its  phase  space.  We 
will  prove  that  in  the  limit  that  system  B  is  very  large,  this  probability 
is: 

(25)  DAdrA  =  ce-*“*drA 

where  the  constant  C  is  determined  by  the  normalization  and  where 
P  is  a  parameter  which  depends  on  the  system  B  but  in  the  limit  is 
independent  of  its  size.  The  probability  distribution  (25)  Gibbs 
calls  the  canonical  ensemble. 

There  are  many  versions  of  the  proof  of  this  theorem15  and  it  can 
be  presented  in  various  degrees  of  rigour.  The  following  simple 
proof  fulfills  I  think  all  the  physical  requirements.  Let  V(E)  be  the 
volume  in  T-space  of  the  region  where  H  S  £,  and  call  S(E)  = 
dy/dE.  S(E)  is  then  the  “area”  of  the  energy  surface  H  =  E,  which 
can  also  be  expressed  as: 


*«-J-/ii5rar 

H  K 

The  micro-canonical  distribution  in  /"-space  can  now  be  written  in 
the  form: 


(26)  Dmlcro  €„.(.v,  •  •  -v.v)  dr  =  HHS(E)E)  dr 

using  the  Dirac  8-function.  If  the  two  systems  A  and  B  are  in  equili¬ 
brium  in  the  sense  of  the  micro-canonical  ensemble  with  the  total 
energy  E,  then,  since  the  two  /"-spaces  are  orthogonal  to  each  other, 
one  has  for  the  distribution  in  the  combined  /"-space: 

(27)  d  drA  dr„  =  —■  S(HA  +  //„-£)  drA  drB 


where: 
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(28)  S(E)  =  j‘  de  Sa(e)Sb(E  -  e) 

as  follows  immediately  from  the  normalization.  The  distribution 
function  for  system  A  irrespective  of  the  state  of  B  is  therefore: 

°A  drA  =  0  J drH  8(HA  +  hb-  E) 

(29)  =W)S  JEb  Sb(Eh)  HHa  +  Hb  ~  E) 

_  S„(E  -  Ha) 

S(E)  “  A ' 

In  most  proofs,  to  make  things  definite,  one  assumes  that  B  consists 
of  a  large  number  of  weakly  coupled  systems,  so  that  in  turn 

H„=  tU  +  H2+  +Hn. 


Physically  this  means  that  one  assumes  the  heat  reservoir  B  to  be  an 
ideal  gas,  and  there  is  clearly  nothing  against  that.  But  then  we  may 
as  well  assume  that  B  is  an  ideal  gas  of  N  point  molecules,  in  which 
case: 


v 


The  energy  surface  then  becomes  a  hypersphere  of  radius  (2 mE)1  2 
in  the  3/V  momentum  directions  and  a  hypercube  of  edge  L  ( V  =  L3  = 
volume  of  the  vessel)  in  the  3 N  coordinate  directions,  so  that  clearly: 

(30)  S„(E)  =  CvE'™112 

indicating  only  the  dependence  on  E.  Now  write  (29),  using  (28)  in 
the  form: 


Sb(E  -  E) 
S„(E) 


From  (30)  and  for  large  N: 


(31) 

with: 


sh(E  —  f)  (E  —  «ya»-n,a 

S„(E)  \  E  j 


>  e  ec 


din  SB  3 N 

-  .  . .. 

dE  =  2  E 


(32) 
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in  which  all  that  is  assumed  is  that  we  are  interested  only  in  values  of 
e  <  E.  Since  DA  is  constant  on  energy  surfaces  HA  =  e  one  gets: 

(33)  Da  drA  =  drA  - 

J  diSA(t)e-tc 

which  is  of  the  form  (25)  and  which  is  automatically  normalized  to 
one,  since: 

j  -  j<irA  Da(H)  =  J"  dt  SA(e)DA(e). 

Note  that  I//3  is  proportional  to  the  average  energy  per  particle  of 
the  heat  reservoir  and  is  in  this  sense  independent  of  the  “size”  of 
the  reservoir. 

Note  that  if  we  had  several  systems  Au  A3-  ■  ■  each  coupled  to  the 
very  large  reservoir  B,  then  ihe  probability  for  each  of  them  to  be  in 
their  region  dT M  is  given  by  the  canonical  distribution  with  the  same 
j3.  The  quantity  0  has  therefore  all  the  required  temperature  properties 
and  it  must  be  an  universal  function  of  the  thermodynamic  tempera¬ 
ture. 

Note,  finally,  that  the  system  A  will  always  be  canonically  distri¬ 
buted,  whatever  its  size.  It  could  consist  of  a  few  or  even  one  mole¬ 
cule.  However,  we  will  only  use  the  theorem  in  the  case  that  A  is  of 
macroscopic  size.  In  this  case  because  of  the  steepness  of  increase 
of  Sa(E)  with  E,  the  canonical  distribution  (33)  will  be  very  narrow 
around  the  average  value,  so  that  then  the  difference  between  the 
canonical  and  the  micro-canonical  distribution  will  be  slight.16 

7.  The  first  law  of  thermodynamics.  Since  both  in  the  first  and  in 
the  second  law  of  thermodynamics  one  considers  changes  of  the 
macroscopic  state  of  the  system,  we  first  need  to  look  more  closely 
at  the  outside  potential  U  in  the  Hamiltonian  (1)  through  which  the 
changes  of  state  are  produced.  In  general  one  can  say  that  this 
potential  U(r„  a2  )  will  depend,  besides  on  the  position  r(  of 
the  ith  molecule,  on  a  number  of  parameters  ak  which  characterize 
the  outside  fields.  Suppose  for  instance  that  the  outside  field  is 
produced  by  m  fixed  and  independent  centres  of  force.  Then: 

m 

U(r „au  a2  •  ■)  =  £  <*\(  |rt  -  Rk|) 

k  *  X 

and  the  parameters  ak  are  the  positional  coordinates  Rk  of  the  centres. 
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The  force  on  the  /th  molecule: 


(34) 


is  equal  and  opposite  to  the  sum  of  the  forces: 


(35) 


ix*  = 


const. 


exerted  by  the  /'th  molecule  on  the  centres  as  required  by  Newton’s 
third  law. 

Although  it  is  always  possible  to  describe  the  outside  force  fields 
in  such  detail  that  the  equality  of  action  and  reaction  is  put  in  evidence, 
it  is  in  general  not  convenient  to  do  so.  Already  if  the  force  centres 
Rk  are  not  independent  but  are  rigidly  connected  to  each  other,  then 
it  is  clearly  indicated  to  describe  the  centres  by  the  six  coordinates  of 
the  rigiy. structure  of  connected  sources.  In  this  case  the  force  Fi 
on  the  ith  molecule  is  still  related  to  the  generalized  force: 


acting  “in  the  direction  ak"  on  the  sources,  but  the  relation  is  more 
complicated  than  in  (34)  and  (35).  In  the  general  case  we  will  there¬ 
fore  not  bother  with  the  relation  between  the  force: 


acting  on  the  /th  molecule  and  the  generalized  force  Xk  in  the  direction 
ak.  The  forces  Xk  will  determine  the  work  done  on  the  gas  for  fixed 
configuration  of  the  molecules  if  one  changes  the  outside  force  fields 
by  changing  the  parameters  ak.  In  fact  this  work  will  be  given  by: 

(37)  51*'  =  -  X,  Sa,  -  X3  S<j2 - 

where  the  equality  of  action  and  reaction  is  used  implicitly. 

Turning  now  to  the  first  law  of  thermodynamics,  I  have  remarked 
already  that  since  conservation  of  energy  is  so  to  say  built  into  the 
theory  the  only  question  remaining  is  how  to  distinguish  between  the 
external  work  W  done  on  the  system  and  the  quantity  of  heat  Q  put 
into  the  system. 

Work  can  only  be  done  on  the  gas  by  changing  the  parameters  ak, 
and  from  (36)  and  (37)  we  see  that  the  work  done  on  the  gas  is: 

8W'  =  ~'£%khak 

k 


(38) 


where: 

(39) 
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is  the  force  in  “direction”  ak  averaged  over  all  the  configurations  of  the 
gas.  If  one  takes  for  D  the  microcanonical  distribution  then  clearly 
bW  must  be  equal  to  the  increase  bE  of  the  energy,  or  in  thermo¬ 
dynamical  language  the  change  b  is  then  an  adiabatic  one.  If,  how¬ 
ever,  one  takes  for  D  the  canonical  distribution,  then  after  the  change 
bak  of  the  parameters  ak  the  original  distribution  will  in  general  not 
be  an  equilibrium  distribution  with  respect  to  the  heat  reservoir. 
A  redistribution  will  take  place,  which  will  change  the  average  energy, 
and  we  will  define  the  quantity  of  heat  bQ  put  into  the  system  by: 

(40)  bQ  =  bE  -  bW. 

That  this  is  appropriate  can  be  seen  also  by  saying  that  the  average 
energy  can  also  be  changed  by  changing  the  P  of  the  heat  reservoir 
without  changing  the  parameters  ak,  and  in  this  case  the  change  should 
clearly  be  called  the  heat  put  into  the  system. 

Note  finally  that  since  usually  the  only  outside  potential  we  consider 
is  the  wall  potential  for  which  there  is  only  one  parameter  ak  namely 
the  volume  V  of  the  vessel,  one  can  write  in  this  case  Eq.  (38) 

W  =  - pbV 

where : 


(41)  />  -/-/««•(-  §)D 

is  clearly  the  pressure  exerted  by  the  gas  which  if  D  is  the  canonical 
ensemble  will  be  a  function  of  V  and  p.  The  macroscopic  state  of  the 
system  is  then  specified  by  V  and  p,  or  in  the  general  case  by  p  and  the 
parameters  ak. 


8.  The  second  law  of  thermodynamics.  We  will  now  show  that  for  a 
change  8  in  which  both  the  p  of  the  heat  reservoir  and  the  parameters 
a, c  are  changed  in  such  a  slow  or  “reversible"  way  that  the  system 
may  always  be  considered  to  be  canonically  distributed,  the  quantity 
pbQ  is  a  perfect  differential  of  a  function  of  the  state  of  the  system, 
that  is  a  function  of  P  and  the  ak. 

To  prove  this,  write  the  average  energy  in  the  form: 


E  = 


jdr  He 

Jdre-11" 


e_ 

ep 


InZ 
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with: 

(42)  Z  =  jdre~tH. 

Hence: 

From  (38)  follows  that  the  work  done  on  the  system  can  be  written  as: 


hw  = 


jdreBH  p 


1  v  8  In  Z  . 

=  --sl-e r-K. 


Therefore: 

P  &Q  =  p(&E  -  &W) 


(43) 


ga  In Z  _  gin. 


"  -  a - toz] w  -  ? s:  [^ - lnZ]  8“* 


which  concludes  the  proof. 

Comparison  with  &Q/T  =  SS  shows  that: 


(44) 


(45) 


S  =  ~kP3  In  zj  +  const. 

=  -^.(kTinZ)  +  const. 
cl 


where  T  is  the  absolute  thermodynamic  temperature,  S  the  entropy 
and  k  an  universal  constant  which  cannot  be  determined  from  this 
argument.  To  find  k,  one  must  consider  the  case  of  an  ideal  gas, 
enclosed  in  the  volume  V.  Then: 


Z~L  "  'Sr*' 

=  Vs(2nmkT)3Nia. 
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From  (41)  this  gives  for  the  pressure: 

(46)  p  =  jp(kT\nZ)  = -y- 

So  one  gets  the  ideal  gas  law  if  k  =  R/N  is  the  gas  constant  per 
molecule. 

I  will  omit  a  more  detailed  discussion  about  the  question  in  which 
precise  sense  the  entropy  S  as  defined  by  (45)  has  also  the  property 
(required  by  the  second  law  for  irreversible  processes)  of  increasing 
in  any  spontaneous  transition  from  one  equilibrium  state  to  another. 
Since  we  have  explained  the  zeroth  law  the  reader  will  believe  that 
this  part  of  the  second  law,  which  is  a  more  precise  expression  of  the 
irreversible  behaviour  of  macroscopic  systems,  can  be  “explained" 
in  a  similar  way. 

Finally  a  remark  about  the  constant  in  the  equation  (45)  for  the 
entropy.  Clearly  from  the  argument  given  it  is  not  possible  to  fix 
this  constant  and  one  cannot  even  determine  its  dependence  on  the 
number  of  particles  N.  Thermodynamically  only  entropy  differences 
between  states  which  can  be  connected  by  a  reversible  transition  are 
operationally  defined.  Therefore  since  N  is  fixed,  the  dependence  of 
the  entropy  on  N  can  only  be  agreed  upon  by  convention.11  How¬ 
ever,  it  certainly  is  a  sensible  convention  to  require  that  the  entropy 
at  fixed  T and  for  large  N  and  large  V,  so  that  r  =  V/N  is  fixed,  becomes 
of  the  form : 

(47)  S  =  Nr,(c,  T). 

The  entropy  becomes  then  a  so-called  extensive  variable,  that  is  pro¬ 
portional  to  the  size  of  the  system  when  the  intensive  variables  T  and  t> 
are  given.  In  addition  one  also  would  like  to  insure  that  all  the 
classical  results  agree  in  the  limit  T  >  oc  with  the  results  of  the 
quantum  statistics,  if  one  makes  the  usual  assumption  that  each  non¬ 
degenerate  energy  level  has  the  statistical  weight  equal  to  one.  Note 
that  this  is  of  course  also  a  convention,  since  only  the  ratios  of  the 
weights  of  different  levels  have  operational  meaning,  but  it  certainly 
is  a  very  appealing  convention.  Both  these  objectives  are  realized  by 
omitting  the  constant  in  (45)  and  by  defining  Z  instead  of  by  (42),  by: 

=  (viW  L  ’ '  ‘ Jv  ■■■**' cxp ( ■ ~  Fr£ M 


(48) 


24  THE  EXPLANATION  OF  THE  LAWS  OF  THERMODYNAMICS 


where  h  is  Planck's  constant,  A  =  hft2nmkT )1/a,  and  where  in  the  second 
line  the  integration  over  the  momenta  has  been  carried  out.  Z  is 
called  the  partition  function.  Comparison  with  the  thermodynamic 
formulas  then  shows  that: 


(49) 


nV,T,  N)  =  -kT\nZ 


is  the  (Helmholtz)  free  energy,  from  which  all  other  macroscopic 
quantities  follow.  Especially: 


(50) 
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II 

ii 
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and  so  on.  In  Chapter  II  we  will  show  that  for  large  N  and  V, 
v  =  VjN  fixed : 


(51) 


V(V,T,N)  =  NKv.  T) 


from  which  (47)  follows. 
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1.  For  a  critical  discussion  of  the  foundations  of  statistical  mechanics 
the  article  of  P.  and  T.  Ehrenfest  ( BegriffUche  Grundlagen  der  Statis- 
tischen  Auffassung  in  der  Mechanik,  Enzyclopadie  der  Math.  Wiss. 
Vol.  IV,  Art.  32,  1912),  is  still  indispensable.  It  is  now  also  available 
in  English  translation  (by  M.  J.  Moravcsik,  Cornell  University  Press, 
1959).  For  a  recent  account  and  for  the  recent  literature  see  for 
instance  A.  Munster,  Prinzipien  der  Slatistischen  Mechanik ,  Handbuch 
der  Physik,  Vol.  III/2,  1959. 

The  difference  in  attitude  with  respect  to  the  foundations  is  for 
instance  illustrated  by  the  fact  that  a  mathematician  (e.g.  Khinchin, 
see  his  book:  Mathematical  Foundations  of  Statistical  Mechanics, 
translated  by  G.  Gamov,  Dover  reprint,  1949),  is  inclined  to  look 
upon  the  ergodic  theorems  of  Birkhoff  e.o.  as  the  proper  basis  for 
the  theory,  while  a  physicist  is  often  inclined  to  think  that  these 
theorems  are  either  almost  obvious  or  not  really  essential.  This  is  for 
instance  the  view  expressed  by  Landau  and  Lifshitz  in  their  excellent 
book  Statistical  Physics  (Pergamon  Press,  London,  1958).  These 
authors  start  from  the  beginning  not  with  a  closed  system,  but  with  a 
system  interacting  with  a  large  heat  reservoir.  Other  physicists  (e.g. 
Tolman  in  his  well  known  book,  The  Principles  of  Statistical  Mechanics 
(Oxford,  1938))  more  or  less  postulate  the  microcanonical  ensemble  as 
the  representation  of  a  closed  system  in  thermal  equilibrium.  It  is 
true  that  in  either  of  these  two  ways  one  can  avoid  the  vexing  problem 
of  the  approach  to  equilibrium  while  one  keeps  all  that  is  needed  for 
the  treatment  of  the  problems  of  the  equilibrium  theory.  However, 
it  seems  to  us  that  the  goal  should  be  an  unified  treatment  of  both 
equilibrium  and  non-equilibrium  statistical  mechanics.  To  do  this, 
one  cannot  avoid  the  ergodic  problem,  and  the  Birkhoff  ergodic 
theorems  then  become  an  important  first  step,  which  however  does  not 
exhaust  the  subject! 

2.  The  assumptions  of  a  hard  core  and  of  a  finite  range  are  only 
made  in  order  to  have  a  definite  mathematical  model  for  the  inter- 
molecular  potential.  From  the  quantum  theory  of  the  van  der 
Waals’  forces  between  neutral  molecules  it  only  follows  that  at  short 
distances  one  has  a  very  sharp  repulsion  while  at  larger  distances 
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there  is  an  attraction  which  goes  to  zero  as  \/r8.  Since  the  theory 
does  not  give  an  explicit  expression  for  <f>(r)  it  is  customary  to  use 
simple  semi-empirical  formulas,  as  for  instance  the  so-called  Lennard- 
Jones  ( 1 2,  6)  potential : 


which  represents  the  two  features  of  <f>(r)  which  the  quantum  theory 
predicts.  We  will  come  back  to  this  in  Chapter  II. 

3.  The  theorem  of  Liouville  proves  that  the  volume  in  phase-space 
is  a  so-called  integral  invariant  of  the  motion.  It  is  well  known  that 
the  volume  is  only  one  of  a  series  of  such  integral  invariants  which 
were  discovered  by  Poincare.  One  can  prove  that  for  any  ensemble 
flow  and  for  any  s  such  that  I  ^  s  ^  n: 

<b)  J  [  2  d<lu  dPh  In  dPh  d(iJPi.  =  0. 

In  here  the  integral  goes  over  a  25-dimensional  surface  and  the  sum 
goes  over  all  „C,  different  combinations  of  the  indices  /'i  •  •  •/,.  Eq.  (b) 
states  that  the  integral  does  not  change  if  one  moves  with  the  fluid. 
For  5  =  n  Eq.  (b)  becomes  the  Liouville  theorem  in  the  Lagrangian 
form.  Compare  E.  T.  Whittaker,  Analytical  Dynamics ,  4th  ed. 
(Cambridge  University  Press,  Cambridge,  I960),  Chapter  10,  or  for  a 
more  succinct  discussion  H.  Goldstein,  Classical  Mechanics  (Addison- 
Wesley  Press,  Cambridge,  Massachusetts,  1950),  pp.  247-250. 

One  might  well  ask  the  reason  why  the  particular  case  of  (b)  repre¬ 
sented  by  the  Liouville  theorem  plays  such  a  special  role  in  statistical 
mechanics.  We  think  the  simple  answer  is  that  the  Liouville  theorem 
is  the  only  invariance  theorem  which  affects  the  change  of  the  density 
p.  Only  for  singular  density  distributions,  in  which  the  members  of 
the  ensemble  would  be  distributed  over  surfaces  of  dimensionality 
less  than  2 n,  would  the  more  general  invariance  properties  (b)  be  of 
importance. 

4.  Proof  of  the  Poincare  recurrence  theorem.  We  follow  the 
presentation  by  M.  Kac  in  the  Boulder  lectures  of  1957  ( Probability 
and  related  topics  in  physical  sciences ,  p.  63).  Let  A  be  the  region  on 
the  energysurface  H  =  E  around  the  initial  point  P  (see  Fig.  2).  If 
we  follow  all  the  points  of  A  in  their  motion,  then  it  is  easy  to  show 
that: 
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remains  the  same.  To  see  this,  consider  two  energysurfaces  H  —  E 
and  H  -  E  +  AE.  If  dn  is  the  normal  distance  between  these  two 
surfaces,  then  according  to  Liouville  the  cylindrical  volume: 

J-J"* 

will  be  conserved.  Since  AE  =  |grad  H\An  and  AE  is  fixed  this 
implies  that  (c)  remains  constant.  Call  p(A)  the  measure  of  the  region 

A.  This  is  therefore  a  positive  and  finite  number  associated  with  A 
which  is  conserved  in  the  streaming  over  the  energysurface.  Since 
the  total  area  of  the  energysurface  is  finite  because  of  the  boundedness 
of  the  motion  also  the  total  measure  of  the  energysurface  will  be  finite. 

Suppose  now  that  there  are  points  in  A  which  in  their  motion  would 
never  come  back  to  A  and  that  there  are  so  many  of  such  points  that 
they  would  fill  a  subregion  B  in  A,  which  has  a  finite  measure.  We 
will  prove  that  this  is  impossible.  Follow  namely  the  region  B  in  its 
motion  and  consider  the  corresponding  regions  Bu  B2,  occupied 
by  the  points  of  B  at  the  later  times  A,  2A  ••  If  A  is  chosen  large 
enough  so  that  B  and  B ,  do  not  overlap  it  follows  that  none  of  the 
regions  B,  Bu  B2,  ■  overlap.  To  see  this  note  that  if  say  Bn  and 

Bn ,  k  had  points  in  common,  then  from  the  unicity  of  the  motion  (the 
impossibility  of  the  crossing  of  two  streamlines)  it  follows  by  tracing 
the  motion  backwards  that  also  B„  ,  and  Bn,k. ,  must  have  points 
in  common.  Continuing  this  argument  one  would  end  up  with  the 
statement  that  B  itself  must  have  points  in  common  with  Bk,  which 
means  that  B  contains  points  which  after  the  time  kA  would  have 
returned  to  A.  But  this  is  contrary  to  the  supposition  that  B  is  a 
region  containing  all  points  which  never  come  back.  If  the  regions 

B,  Bu  B2,  ■  do  not  overlap,  then  since  p(B)  =  p{Bt)  =  p(B2)  -  ■  ■  ■ 

the  total  measure  of  all  these  regions  would  be  infinite,  which  is 
impossible  since  the  measure  of  the  total  energysurface  is  finite. 
Hence  one  concludes  that  the  measure  of  all  points  in  A,  which  will 
never  return  to  A  must  be  zero,  which  is  the  Poincare  theorem. 

5.  The  proof  can  be  given  with  various  degrees  of  precision.  The 
simplest  way,  following  Boltzmann  and  Tolman  (Principles  of  Statis¬ 
tical  Mechanics,  p.  79)  is  to  take  the  logarithm  of  W  and  to  apply  the 
Stirling  approximation  in  the  weak  form: 

(d)  log  m!  =  n  log  n  -  n. 

One  then  gets: 

(e)  log  W  =  /V  log  N  -  ^  n ,  log  nt  +  ]>  n,  log  wt. 

i  t 
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Varying  the  n,  by  An,,  one  obtains: 


(f)  log  W^W  =  2]  An,(  -  log  n,  -  1  +  log  co()  -  +  ••• . 

The  An,  have  to  fulfill  the  subsidiary  conditions: 

2  An,  =0,  ^  ei  ^n>  ~  0 

i  i 


because  N  and  £  are  given.  Multiplying  these  conditions  with  the 
Lagrange  multipliers  (log  >4  +  1)  and  (-/ 3)  respectively,  and  adding 
the  result  to  (f),  one  sees  that  the  first  variation  vanishes  for  the  set 
fulfilling  the  conditions: 


or: 


-  log  w,  +  log  A  +  log  uj,  -  /3c,  =  0 


ti,  =  Am,  e  6c<. 


That  for  this  set  W  is  a  very  sharp  maximum  follows  from  (f)  since  for 
this  set: 


(g) 


log 


W  4-  AW 

W 


(d«,/n,)2>av 


if  W  is  the  value  of  W  for  n ,  -  n,.  Clearly  if  N  is  very  large  then 
even  a  very  small  average  relative  deviation  'An,ln,>  from  the  n,  will 
reduce  the  corresponding  volume  enormously. 

From  this  proof  we  see  that  a  more  correct  statement  of  the  theorem 
is  that  the  M.B.  distribution,  together  with  those  states  for  which 


<  I 


occupy  almost  all  of  the  energy  shell. 

6.  Requirement  (a)  expresses  the  property  that  any  macroscopic 
description  must  have,  namely  that  it  must  be  insensitive  to  small 
changes  in  the  position  of  the  phase  point  P.  That  such  a  “coarse 
graining"  of  the  T-space  is  necessary  for  the  whole  argument  was 
especially  emphasized  by  P.  and  T.  Ehrenfest  (l.c.  Note  1).  One 
may  say  that  this  coarse  graining  depends  on  the  observer  through  his 
choice  of  the  macroscopic  variables.  On  the  other  hand  (b)  can  be 
looked  upon  as  the  requirement  imposed  by  the  system  (mainly 
because  of  the  fact  that  N  is  very  large)  on  the  choice  of  its  macro¬ 
scopic  description. 
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7.  See  Khinchin,  Statistical  Mechanics ,  Chapter  II,  §5,  or  E.  Hopf, 
Ergodentheorie  (Berlin,  1937),  or  P.  R.  Halmos,  Lectures  on  ergodic 
theory  (Publications  of  the  Mathematical  Society  of  Japan,  no.  3, 
1956),  also  for  the  further  mathematical  literature.  In  the  following 
the  word  “almost"  means  as  usual  “except  possibly  for  a  set  of 
measure  zero”. 

8.  In  the  physical  literature  the  same  idea  is  expressed  by  saying 
that  the  system  must  be  quasi-ergodic.  (See  P.  and  T.  Ehrenfest, 
l.c.,  also  for  the  discussion  and  criticism  of  the  older  ergodic  hypo¬ 
thesis  of  Boltzmann.)  By  this  one  means  that  the  trajectory  starting 
from  almost  any  point  P  will  cross  in  the  course  of  time  any  region 
around  any  point  Q  of  the  energysurface.  A  metrically  transitive 
system  is  obviously  quasi-ergodic,  but  the  inverse  is  not  necessarily 
true.  It  may  be  that  the  energysurface  can  be  divided  in  two  sets 
both  of  finite  measure  and  so  interwoven  that  the  points  in  each  set 
are  everywhere  dense  on  the  surface.  If  the  orbits  starting  from  points 
in  one  set  stay  in  that  set  then  the  system  is  clearly  not  metrically 
transitive  although  it  is  obviously  quasi-ergodic.  From  the  quasi- 
ergodic  property  Eq.  (19)  cannot  be  deduced.  This  was  attempted 
by  A.  Rosenthal  (Ann.  Physik  43(1914)  894)and  the  error  was  pointed 
out  by  A.  Melamit  (Acta  Phys.  Polon.  1  (1932)  281). 

9.  The  best  known  example  is  the  motion  of  a  mass  point  on  a 
surface  of  constant  negative  curvature.  That  the  flow  defined  by  the 
geodesics  on  such  a  surface  is  metrically  transitive  was  first  proved  by 
G.  A.  Hedlund  in  1934  (Ann.  of  Math.  35  (1934)  787).  Compare 
also  E.  Hopf,  Ergodentheorie,  Chapter  V,  and  for  a  simplified  proof 
G.  A.  Hedlund,  Amer.  J.  Math.  62  (1940)  233. 

There  is  also  a  paper  by  Fermi  (Physik.  Z.  24  (1923)  261)  in  which  a 
proof  is  presented  of  the  quasi  ergodicity  of  so-called  canonical  normal 
systems  with  more  than  two  degrees  of  freedom.  These  systems  were 
considered  by  Poincare  (Les  Methodes  Nouvelles  de  la  Mecanique 
Celeste,  Vol.  1,  Chapter  5)  and  they  are  essentially  non-degcncratc 
multiply  periodic  systems  which  are  perturbed  by  a  potential  which  is 
periodic  in  the  angle  variables.  They  include  therefore  a  wide  class 
of  mechanical  systems,  and  it  would  be  of  great  interest  to  know 
whether  Fermi’s  argument  could  be  made  into  a  rigorous  proof  of 
the  metrical  transitivity  of  these  systems. 

10.  Eq.  (19)  is  related  to  a  theorem  proved  by  M.  Kac  (Bull.  Amer. 
Math.  Soc.  53  (1947)  1002)  about  the  average  recurrence  time.  Con¬ 
sider  again  a  discrete  set  of  time  points  d,  2d, • • •  as  in  the  proof  of 


30  THE  EXPLANATION  OF  THE  LAWS  OF  THERMODYNAMICS 

the  Poincare  theorem  (Note  4)  and  an  area  A  on  the  energysurface. 
Let  n (P)A  be  the  first  time  for  a  point  P  in  A  to  return  to  A,  so  that 
P%  c  A.  Then  for  a  metrically  transitive  system  Kac  proves  that  the 
average  recurrence  time  defined  by: 

A 

is  determined  only  by  the  area  of  A  and  in  fact  is  equal  to  ( V/  V(A))  A. 
The  smaller  the  area  A  is,  the  longer  the  average  recurrence  time  will 
be,  and  the  shorter  (according  to  Eq.  (19))  the  point  will  spend  in 
the  area  A. 

11.  The  notion  of  a  normal  property  of  a  system  was  introduced  by 
Jeans  ( Dynamical  Theory  of  Gases,  3rd  ed.,  1921,  p.  74),  and  was 
further  discussed  by  Fowler  ( Statistical  Mechanics,  Cambridge,  1929, 
§  1 .4,  p.  8).  We  think  that  the  definition  used  in  the  text  expresses  more 
precisely  what  these  authors  had  in  mind. 

12.  This  can  also  be  expressed  in  the  following  way.  Since  D(P,  0) 
corresponds  to  a  given  set  of  values  v}0’  of  the  macroscopic  variables 
at  /  =  0,  one  finds  by  integrating  CAP,  t)  over  the  different  regions  on 
the  energy  shell  corresponding  to  the  different  sets  of  values  of  the 
y„  the  conditional  probability  P(  v'(0,|y„  t)  that  at  time  t  the  macro¬ 
scopic  variables  have  the  values  y,  if  at  t  =  0  they  have  the  values 
yi°\  For  /  >  oo,  />(.ii°'|y„  t)  >  M.vt,  JV  ).  where  W()’n  y3  •  )  is 
determiued  by  the  volume  of  the  energy  shell  corresponding  to  the 
set  of  values  y,,  y2  •  .  Because  for  large  I V  there  is  one  set  of  values 
of  the  y,  corresponding  to  an  overwhelmingly  largest  volume,  the 
function  fV(yt ,y2-  •  •)  will  be  very  sharply  peaked  at  this  equilibrium 
set  of  values  of  the  y,. 

Formulated  in  this  way  the  theory  becomes  completely  similar  to 
the  theory  of  stationary  stochastic  processes. 

13.  This  was  done  first  by  J.  von  Neumann  (Z.  Physik.  57  (1929) 
80).  For  a  discussion  and  criticism  of  this  concept  see  the  article  by 
M.  Fierz  in  the  Memorial  Volume  to  Wolfgang  Pauli  (Interscience 
Publishers,  New  York,  I960). 

14.  It  should  be  pointed  out  that  this  terminology  is  not  generally 
used.  For  instance  R.  H.  Fowler  and  E.  A.  Guggenheim  ( Statistical 
Thermodynamics,  Cambridge,  1939,  p.  56)  call  the  postulate  of  the 
existence  of  temperature  the  zeroth  law  of  thermodynamics.  Since  the 
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notion  of  temperature  depends,  besides  on  the  existence  of  thermo¬ 
dynamic  equilibrium,  on  the  transitive  property  of  the  equilibrium  state, 
and  since  for  the  statistical  interpretation  of  the  notion  of  temperature 
one  needs  the  canonical  ensemble,  it  seems  to  us  preferable  to  reserve 
the  term  zeroth  law  to  the  fundamental  fact  of  the  approach  to  equili¬ 
brium  of  a  closed  system. 

15.  See  for  instance:  Boltzmann,  Wissenschaftliche  Abhandlungen, 
Vol.  1,  p.  284;  G.  Krutkow,  Z.  Physik.  81  (1933)  377;  Khinchin, 
Statistical  Mechanics,  Chapter  5,  §19;  H.  Grad,  Comm.  Pure  Appl. 
Math.  5(1952)  455. 

16.  It  seems  very  likely  therefore  that  for  a  system  with  a  large 
number  of  molecules  the  macroscopic  properties  of  the  equilibrium 
state  can  be  obtained  either  from  the  canonical  or  from  the  micro- 
canonical  ensemble  and  that  the  results  will  always  be  the  same.  From 
the  point  of  view  of  thermodynamics  the  difference  between  the  two 
ensembles  lies  only  in  the  choice  of  the  basic  thermodynamic  character¬ 
istic  function  and  in  the  corresponding  choice  of  the  macroscopic 
variables.  It  is  difficult  though  to  prove  this  rigorously  and  in  all 
generality,  so  that  it  should  be  verified  for  any  specific  problem. 

17.  This  was  pointed  out  clearly  in  the  basic  paper  by  P.  Ehrenfest 
and  V.  Trkal  (Ann.  Physik.  65  (1921)  609).  The  neglect  of  this  point 
has  led  to  a  great  deal  of  discussion  and  to  the  so-called  " N\"  con¬ 
troversy.  See  for  instance  E.  Schrbdinger,  Statistical  Thermodynamics 
(Cambridge  University  Press,  1946)  and  O.  Stern,  Revs.  Modern 
Phys.  11  (1949)  534. 


Chapter  II 


Theory  of  the  Non-Ideal  Gas 


1.  The  general  problem.  We  have  seen  that  by  computing  the  parti¬ 
tion  function  Z(  V,  T,  N)  we  should  in  principle  be  able  to  calculate 
all  the  macroscopic  equilibrium  properties  of  the  system  when  the 
intermolecular  forces  are  known.  Looking  now  at  the  experimental 


Figure  I 

facts,  this  implies  that  we  should  be  able  to  explain  the  following 
phenomena : 

(a)  The  deviations  from  the  ideal  gas  laws  which  occur  when  at  fixed 
temperature  the  volume  V  is  no  longer  very  large  compared  to  Nr\ 
(r,  =  range  of  the  intermolecular  force). 

(b)  The  condensation  phenomenon.  At  low  enough  temperatures 
any  gas  starts  to  condense  at  a  critical  density  \/vu  which  shows  itself 
in  the  isotherm  by  a  sharp  discontinuity  in  the  slope  (see  Fig.  1).  For 
v  <  r,  the  pressure  stays  strictly  constant  (vapour-liquid  equilibrium) 
till  at  v2  one  has  only  the  liquid  phase.  Here  there  is  a  second  sharp 
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discontinuity  in  the  slope  of  the  isotherm,  and  for  v  <  v2  the  isotherm 
rises  steeply  corresponding  to  the  small  compressibility  of  all 
liquids. 

(c)  The  existence  of  a  critical  temperature  Tcrit.  Again  for  all  sub¬ 
stances  the  condensation  phenomenon  only  occurs  for  T  <  Tcr ,t.  If 
T  approaches  Tcrn  the  horizontal  portion  of  the  isotherm  becomes 
shorter  and  it  disappears  in  the  critical  point  C.  For  T  >  Tcril  there 
is  no  longer  a  discontinuity  in  the  isotherm. 

There  are  other  general  phenomena.  At  still  smaller  volumes  and 
probably  at  any  temperature  the  substance  solidifies,  and  one  has  the 
corresponding  solid-liquid  and  solid-vapour  equilibria.1  But  the 
explanation  of  these  phenomena  from  the  basic  integral  (/,  48)  for 
Z(V,T,N)  is  still  so  far  from  being  accomplished  that  I  will  not 
bother  the  reader  with  other  experimental  facts.  Before  giving  a 
summary  of  what  has  been  accomplished,  let  us  look  at  the  historical 
background. 


2.  The  theory  of  van  der  Waals.  The  first  great  advance  in  the 
understanding  of  the  properties  of  gases  and  liquids  was  made  by 
van  der  Waals  in  his  famous  Leiden  dissertation  of  1873.  Van  der 
Waals  tried  to  take  into  account  the  effect  of  the  intermolecular 
force  (attraction  at  large  distance,  sharp  repulsion  at  short  distance) 
on  the  equation  of  state  of  the  gas  and  he  arrived  at  the  famous 
equation: 


(I)  (p  +  -p-2)(F  -  h)  =  NkT 

where  a  is  a  measure  of  the  attractive,  h  of  the  repulsive  forces.2  It 
represents  qualitatively  all  known  properties  of  the  gas  and  liquid 
phase  and  especially  it  "explains’'  the  existence  of  two  phases  for 
T  <  7*,r„  and  the  existence  of  a  critical  point.  This  is  shown  in  the 
following  way.  Since  (I)  is  of  the  third  degree  in  V ,  one  easily  sees 
that  for  low  enough  temperature  the  isotherm  is  not  monotomic  but 
shows  a  wiggle  (see  Fig.  2).  Between  the  points  A  and  B  this  must 
represent  an  unstable  state  and  hence  van  der  Waals  says  the  system 
will  split  in  two  phases.  One  now  needs  a  thermodynamic  argument 
(Maxwell  rule  or  equal  area  rule)  to  find  the  volumes  Vu  V2  of  the 
saturated  vapour  and  liquid  and  the  constant  saturated  vapour 
pressure  />,.  The  wiggle  disappears  for  T  >  T,,n.  The  critical  point 
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C  is  fixed  by  the  fact  that  there  the  isotherm  has  an  horizontal  inflexion 
point,  so  that : 


from  which  follows: 

(2)  yc  =  3  b,  pc  =  RTC  =  ~ 


Figure  2 


I  will  not  try  to  review  the  further  remarkable  successes  of  the  van  der 
Waals  equation.  In  fact,  they  were  so  remarkable  that  they  practi¬ 
cally  killed  the  subject  for  more  than  fifty  years!  Let  me  only  still 
mention  that  a  consequence  of  ( I )  is  the  so-called  law  of  corresponding 
states,  which  says  that  if  one  measures  p,  V  and  T  with  the  critical 
quantities  (2)  as  units,  the  resulting  equation  of  state  is  the  same  for 
all  substances.  This  law,  which  was  instrumental  in  the  precise 
predictions  of  Kamerlingh  Onnes  of  all  the  factors  involved  in  the  first 
liquefaction  of  Helium,  follows  from  (1)  since  (I)  involves  only  two 
constants  a  and  b.  The  rather  general  validity  of  the  law  indicates 
that  the  intermolecular  potentials,  say  for  all  inert  gases,  can  all  be 
represented  by  the  same  equation  if  one  only  adjusts  the  units  of 
length  and  energy.  A  good  representation  of  the  potential  (we  will 
soon  see  how  this  can  be  checked)  is  for  instance  the  (12,6)  Lennard- 
Jones  potential:  (see  Fig.  3) 


<£(r)  =  4f 
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From  a  simple  dimensional  argument  it  then  follows  strictly  from 
Z(  V,  T,  N)  that  if  one  measures  all  lengths  in  units  a,  and  the  tempera¬ 
ture  in  units  e/k  the  reduced  equation  of  state  should  be  universal.3 


In  turn  this  means  that  the  ratios  VJNo3  and  kTJ f  should  be  universal. 
In  fact  for  the  inert  gases  one  finds: 


c  ~  I  SO 
2nNo3/i  = 


which  allows  a  quick  estimate  of  the  strength  and  range  of  the  inter- 
molecular  potential  from  the  critical  quantities. 

To  come  now  to  the  criticism  of  the  van  der  Waals  equation,  the 
point  which  shows  most  clearly  that  the  van  der  Waals  equation  can 
not  be  considered  as  in  any  way  final,  is  that  an  additional  thermo¬ 
dynamic  argument  is  required  to  fix  the  saturated  vapour  pressure. 
An  autonomous  statistical  theory  of  the  non-ideal  gas  and  the  liquid 
state  should  not  require  any  thermodynamics.  Also,  it  is  clear  that 
since  from  Z(V%T,N)  one  should  find  the  state  of  thermodynamic 
equilibrium  it  should  be  impossible  in  any  bonafide  derivation  of  the 
equation  of  state  from  Z  to  find  unstable  or  metastable  states  of  the 
system.  In  fact  we  will  show  in  Chapter  III  that  the  pressure  as 
derived  from  Z  must  be  a  monotonic  non-increasing  function  of  the 
volume.  All  “proofs"  of  the  van  der  Waals  and  similar  equations  of 
state  contain  therefore  necessarily  unallowed  extrapolations  or 
approximations.  Finally  there  is  the  failure  of  the  van  der  Waals 
equation  to  describe  the  phenomena  quantitatively  correct,  although 
it  is  unsurpassed  qualitatively.  Already  in  1901,  this  failure  led 
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Kamerlingh  Onnes  to  abandon  all  closed  expressions  for  the  equation 
of  state  and  to  represent  the  data  by  a  series  expansion  of  the  form: 


(3) 


pv 

RT 


cm 

f/2  +  ’  '  ' 


This  is  called  the  virial  expansion:  B(T )  is  the  second  virial  coefficient, 
C(T)  the  third,  etc.  The  representation  (3)  was  not  only  desperation, 
but  it  contained  the  insight  that  the  successive  deviations  from  the 
ideal  gas  law  will  give  information  about  the  interaction  of  the  mole¬ 
cules  in  pairs,  triples,  etc.  This  has  been  confirmed  by  the  theoretical 
derivation  of  (3)  from  the  partition  function  Z,  first  given  in  all 
generality  by  Ursell  and  Mayer  around  I930.4  It  is  a  part  of  the 
theory  which  is  in  a  really  satisfactory  state. 

3.  The  Mayer  trick  and  the  connection  with  the  theory  of  linear  graphs. 

As  a  first  step  in  the  discussion  of  Z(V,  T,  N)  following  Mayer,  we 
introduce: 

<4)  =  /( |T|  -  r,| )  =  exp  [-  |r,  -  r,| )]  -  1 

and  rewrite  Eq.  (48)  of  Chapter  I  in  the  form: 

(5)  z^r^)  =  iv!ir«(  d  +/;*>• 

The  general  form  of  the  function  /(r)  is  shown  in  Fig.  4.  Since  ff 
vanishes  when  |r,  -  r,|  is  greater  than  the  range  rx  of  the  forces,  an 
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expansion  of  the  product  in  (3)  will  lead  to  an  expansion  of  Z  in 
powers  of  V.  To  see  this  more  in  detail,  represent  each  term  in  the 
expansion  of  the  product  by  a  linear  graph  in  which  each  particle  is 
indicated  by  a  point  and  each  factor  J\,  by  a  line  between  the  points 
i  and  j.  For  instance  for  N  =  3: 


3 


n  a 


+/„)  = 


2 


+  J 12  +  fia 

A  A 


+  /a  3 

•  l 


(6) 


+  fitfaa  +  f\  2/1 3  +  f  13/33  +  fi  2/23/31 

A  A  A  A 

232323  23 

where  we  have  drawn  the  graph  under  each  term.  More  symbolically 
for  N  =  4 : 


n,"-A  >1  >1  I  +  rAA 


(I)  (6)  (3)  (12)  (4) 


(7) 


(12)  (4)  (12)  (3)  (6)  (1) 


where  the  numbers  indicate  the  number  of  terms  represented  by  each 
graph  if  one  labels  the  four  points. 

Each  term  in  the  expansion  of  (5)  corresponds  therefore  to  a 
particular  graph  of  N  labelled  points,  which  in  general  will  consist  of 
a  number  of  disjoint  connected  parts.  In  fact  all  labelled  N-point 
graphs  from  separate  points  to  the  complete  graph  with  N(N  -  l)/2 
lines  will  appear.  Now  it  is  clear  that  each  term  in  the  expansion 
will  lead  to  a  product  of  integrals,  one  for  each  connected  part  of  the 
corresponding  graph.  For  example,  a  point  will  give  the  integral: 
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a  line  (/,/')  the  integral: 


jv  jv  *t 

a  triangle  (i.j,  k)  the  integral: 

(8)  j  j  j  dttdt)  dfkfifkftk 

and  so  on.  We  will  call  a  connected  graph  also  sometimes  a  cluster 
and  the  corresponding  integral  like  (8)  a  cluster  integral.  The  advan¬ 
tage  of  such  an  expansion  of  the  integrand  in  (S)  is  that  because  of  the 
short  range  of  the  forces  (beyond  which  the  f,  are  zero),  the  leading 
term  of  each  cluster  integral  is  proportional  to  V.  It  is  this  feature 
which  has  allowed  Ursell  and  Mayer  to  derive  the  virial  expansion  (3) 
and  to  find  explicit  expressions  for  the  virial  coefficients  in  terms  of 
the  functions 

4.  Some  notions  of  the  theory  of  linear  graphs.  Since  we  need  some 
of  the  simple  notions  of  the  theory  of  graphs,  and  since  part  of  the 
terminology  I  am  used  to  is  semi-private,  let  me  collect  here  all  that 
we  will  need.5 

A  linear  graph  is  a  collection  of  points  with  lines  between  certain 
pairs  of  points.  In  general  there  may  be  more  than  one  line  connect¬ 
ing  two  points  and  also  loops  may  occur.  If  any  two  points  are 
connected  by  at  most  one  line  and  if  there  are  no  loops  we  call  the 
graph  simple  (see  Fig.  5).  We  will  mainly  have  to  deal  with  such 
graphs.  A  subset  of  points  which  are  joined  successively  by  lines  is 
called  a  path  connecting  the  initial  and  final  point.  If  the  final  point 
coincides  with  the  initial  point  we  speak  of  a  cycle.  A  connected 
graph  is  one  in  which  there  is  at  least  one  path  between  any  two 
points.  Otherwise  the  graph  is  disconnected.  It  is  clearly  sufficient 
to  study  connected  graphs.  An  articulation  point  is  a  point  where  a 


Figure  5.  (a)  simple  graphs;  (b)  not  simple  graphs 
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graph  can  be  cut  into  two  or  more  disconnected  parts.  A  graph 
without  articulation  points  I  call  a  star  (see  Fig.  6).  A  general  con¬ 
nected  graph  can  be  considered  as  built  up  out  of  the  constituent 
stars  hung  together  at  the  articulation  points.  If  the  constituent 
stars  are  just  single  lines  we  call  the  graph  a  Cayley  tree.  It  can 
also  be  characterized  as  a  connected  graph  without  cycles.  If  the 
stars  are  polygons  we  will  speak  of  a  Husimi  tree.  It  is  a  connected 
graph  in  which  each  line  belongs  to  at  most  one  cycle  (see  Fig.  7). 
If  more  general  types  of  stars  are  used  we  will  speak  of  star  trees.  A 
pure  star  tree  is  one  which  consists  of  only  one  type  of  star;  otherwise 
the  star  tree  is  mixed.  Of  course  if  no  restrictions  are  made  on  the 
type  of  stars  then  the  notions  of  star  tree  and  of  connected  graph  are 
synonymous. 

In  the  physical  applications  graphs  usually  appear  as  a  symboliza¬ 
tion  of  the  different  terms  in  a  successive  approximation  method  and 
the  number  of  points  usually  determines  the  order  of  the  approxima¬ 
tion.  In  each  successive  approximation  method  one  can  therefore 
distinguish  two  problems: 

(a)  The  combinatorial  problem:  how  many  “different”  terms  are 
there  in  /ith  order  ?  This  can  then  be  expressed  by  asking  how  many 


Figure  7.  (a)  Cayley  tree,  (b)  Mixed  Husimi  tree,  (c)  Pure  star  tree. 
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graphs  of  n  points  and  of  a  specific  type  (as  determined  by  the  problem) 
there  are. 

(b)  The  integral  problem:  what  is  the  contribution  or  the  weight 
of  each  graph? 

In  the  combinatorial  problem  one  must  distinguish  between  labelled 
graphs  and  free  (or  topological )  graphs.  In  a  labelled  graph  different 
points  are  distinguished  by  some  index.  In  a  free  graph  the  points 
are  regarded  as  not  distinguishable.  Of  fundamental  importance  is 
the  number  of  different  labelled  graphs  corresponding  to  a  given  free 
graph.  To  enumerate  the  possibilities  one  needs  the  notion  of  the 
group  of  a  graph.  This  is  defined  as  the  group  of  automorphisms  of 
the  graph,  that  is  the  group  of  the  one  to  one  correspondences  of  the 
points  of  the  graph  which  leave  the  connections  invariant.  The  group 
can  be  considered  as  a  permutation  group  of  the  points.  For  example, 
for  an  /i-gon  the  group  is  the  dihedral  group  of  order  In,  and  for  a 
so-called  complete  graph  of  n  points  (i.e.,  the  graph  where  all  pairs  of 
points  are  joined  by  lines)  the  group  is  the  symmetric  group  of  degree 
n.  The  order  of  the  group  is  the  symmetry  number  s  of  the  graph. 
If  one  has  a  free  graph  of  p  points  and  symmetry  number  s,  then  there 
are  clearly  p\/s  different  ways  in  which  one  can  label  the  points  so  that 
one  obtains  p\/s  different  labelled  graphs. 

5.  The  first  Mayer  theorem.  We  now  return  to  the  calculation  of 
the  partition  function  Z{  V,T,  N)  as  given  by  (5)  and  we  will  consider 
that  the  product  is  expanded  and  that  each  term  is  symbolized  by  a 
graph  of  N  labelled  points  as  explained  in  §3.  We  also  saw  that  then 
each  term  becomes  a  product  of  factors  each  referring  to  a  disjoint 
connected  part  of  the  graph.  Mayer  showed  that  the  resulting  sum 
of  products  can  be  put  in  a  simple  form.  To  derive  this,  it  is  illumin¬ 
ating  to  consider  Mayer’s  result  as  a  special  case  of  the  following 
general  theorem.6 

Theorem  I.  Consider  the  quantity  FN  defined  by. 

(9)  Fn  =  ^  W'fG*) 

(C„> 

where  the  sum  goes  over  all  graphs  G»  {connected  or  disconnected)  with 
N  labelled  points,  and  where  tV{GN)  is  a  “  weight "  which  is  a  function 
of  the  graph  GN  and  which  is  supposed  to  have  the  properties : 

(a)  W(G»)  is  independent  of  the  labelling  of  the  N  points,  which 
means  that  it  only  depends  on  the  free  graph. 

(b)  W(Gn)  =  1 1  W(C)  where  the  product  goes  over  all  the  disjoint 


THE  FIRST  MAYER  THEOREM  41 


connected  parts  C,  of  GN. 

Introduce  further  the  quantity. 

(10)  f  =  %  me,) 

(C,) 

where  the  sum  runs  over  all  connected  graphs  with  labelled  points.  The 
theorem  then  states  that : 

(11)  1  +  F(x)  =  e,(t) 

where  Fix)  and  fix )  are  the  generating  functions  of  the  quantities  FN 
and  f 1  defined  by. 

(12)  Fix)  =  2  F"  W 

N~  1  '  I  -  1  '• 

The  proof  of  the  theorem  depends  on  the  following  fundamental 
lemma,  which  we  will  use  over  and  over  again,  so  that  we  will  formu¬ 
late  and  prove  it  in  ail  generality. 

Product  Theorem.  Consider  two  arbitrary  ( finite  or  infinite) 
collections  to  and  W  of  free  graphs  and  form  the  product  collection 
to  x  («  which  consists  of  all  possible  pairs  of  graphs  one  from  to  and 
one  from  OS.  Assign  weights  to  each  of  the  graphs  in  to  and  (H,  and 
assign  to  each  graph  of  the  product  collection  the  product  of  the  weights 
of  the  two  graphs  of  which  it  is  formed.  If  now  we  label  the  graphs  in 
to,  ('i  and  to  x  os  in  all  possible  different  ways  and  form  generating 
functions  similar  to  (12)  (the  upper  limit  can  be  finite  or  infinite),  then 
the  theorem  states  that  the  generating  function  for  the  product  collection 
is  the  product  of  the  generating  functions  for  the  two  collections  to  and  W. 

Pr(X)e.  The  total  weight  assigned  to  all  graphs  of  p  points  in 
collection  to  is: 


where  the  sum  goes  over  all  different  free  graphs  of  p  points  in  to  and 
s,  and  ht  are  the  symmetry  number  and  the  weight  of  the  ith  free 
graph.  Similarly  the  total  weight  assigned  to  all  graphs  of  q  points 
in  collection  <«  is: 


For  a  pair  of  graphs  (»,/)  from  to  and  «  the  symmetry  number  is 
clearly  Note  that  this  is  so  even  when  the  two  graphs  i  and  j 
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are  identical,  because  the  operation  which  transposes  the  two  graphs 
must  not  be  considered  as  an  element  of  the  group  of  the  product 
graph.  Since  the  weight  assigned  is  high  it  is  clear  that  the  total 
weight  assigned  to  all  graphs  of  N  points  in  the  product  collection 

to  x  w  is: 


PN 


y  ( P  ±  g)! 

P Sr-*  pW- 


HrG„. 


Hence,  defining  the  generating  functions: 


#/(*>  =  2H'%'  =  IGS 

P  P‘  « 

it  follows  that : 

P(x)  =  H(x)G(x). 

This  proof  makes  clear  why  it  is  necessary  to  define  the  generating 
functions  with  the  factorials  in  the  denominator. 

From  the  product  theorem  the  proof  of  Theorem  I,  that  is  of  Eq. 
(II),  is  immediate.  If  FJx)  is  the  generating  function  for  graphs  of 
m  disjoint  parts  then : 

(13)  F(x)  =  £  FJx) 

m  »  1 

and  from  the  product  theorem  follows  that: 

(14)  FJx)  =  1  [/(*)]* 

ml 


where  the  ml  is  required  because  the  m  parts  are  chosen  from  the 
same  collection  of  connected  graphs,  and  any  permutation  of  the  m 
parts  leads  to  the  same  disjoint  graph.  From  (13)  and  (14)  Eq.  (11) 
follows. 

Returning  after  all  this  to  the  partition  function  Z{V,  T,  N)  it  is 
clear,  if  one  thinks  the  product  expanded,  that  one  can  write: 

(15)  Z(F,T,  N)=  ±  2  W(G^ 

•(Cw> 

with: 

W(Gn)  =  Jv  /  c^Tl'  (^Tn  Wf*1 


(16) 
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where  the  product  is  over  all  pairs  of  points  ( ij )  which  are  connected 
by  a  line  in  GN.  Thus  if  one  forms  the  generating  function: 

(17)  zgr(  y ,  t,z)  =  2  z(  y,  t,  NWz)" 

Nrn  o 

then  (except  for  the  A3,  introduced  to  cancel  the  A3"  in  (16))  Zfr  is 
clearly  similar  to  I  +  F(x)  if  by  convention  one  puts  Z(V,  T,  0)  =  1. 
The  function  Ztr(  K,  T,  z)  is  called  the  grand  canonical  partition 
function  to  distinguish  it  from  the  ordinary  or  canonical  partition 
function  Z(  y,  T,  N ). 

Since  clearly  the  weight  W(GN)  defined  by  (16)  fulfills  both  of  the 
conditions  required  for  the  applicability  of  Theorem  I,  one  can 
express  Z(  y,  T,  z)  in  terms  of  the  corresponding  function  for  connected 
graphs.  Calling  (to  conform  with  the  usual  notation): 

(18)  b,(y,T)  =  ■■■[  *i  "*,  V  n/./ 

where  the  product  goes  over  all  f,  corresponding  to  lines  in  the 
connected  graph  C,  and  the  sum  goes  over  all  connected  graphs  with 
/  labelled  points,  and  calling: 

(19)  x(F,r,z)  =  2  b,(y,T)z‘-, 

1-  1 

then  clearly  F*(  V,  T,  :)  is  similar  to  the  function  f(x)  of  Theorem  I, 
and  hence : 

(20)  Z„(V,T,z)  =  exv{VX(V,T,z)} 
which  is  the  first  Mayer  relation.7 

6.  The  Mayer  equations.  From  (17)  and  (20)  it  follows  that: 

(21)  A ™Z(y,  T,  =  dz  z-"-'  exp{Fx(K,  T,  z)}. 

Now  note  that  because  of  the  hard  core  assumption  for  the  inter- 
molecular  potential  (see  Chapter  1,  §1),  Z(V,T,N)~  0  if  N  > 
y/(4nrol'i),  so  that  Ztr(y,T,z)  is  a  polynomial  in  z  with  positive 
coefficients.  Therefore  *(  F,  T,  z)  will  be  a  monotonic  increasing 
function  of  z  along  the  positive  real  axis.  Since  z~N~l  is  a  sharply 
decreasing  function,  the  integrand  in  (21)  will  have  a  steep  saddlepoint 
at  some  point  z0  on  the  positive  real  axis,  in  the  limit  that  N  and  V 
are  large  such  that  Nv  =  V  with  v  fixed.  In  this  limit  one  can  evaluate 
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(21)  by  the  method  of  steepest  descent.  Call: 

J\z)  =  -  In  z  +  v\(Nv,  T,  z); 
then  one  gets  in  the  usual  way : 

**/<*„> 

(22)  A™Z  =  {2irNJ“(Zo))lla 
where  r0  is  determined  by  f'(z0)  =  0  or: 

(23)  7  =  37  x(M>,  T,  z0). 

Now  note  that  for  large  K  and  fixed  /,  b,(  V,  T )  becomes  independent 
of  V.  This  is  due  to  the  fact  that  the  integrand  in  (18)  consists  of  a 
connected  set  of  factors  /„  so  that,  keeping  say  rj  fixed,  the  integrand 
will  be  zero  outside  a  sphere  around  of  radius  of  order  /  times  the 
range  of  the  forces.  Integrating  finally  over  r,,  one  sees  that  because 
of  the  factor  1  /  V  in  (18)  the  limit : 

(24)  Lim  />,( jVr,  T)  =  h,(T) 

S  •  r 

exists.  Let  us  now  assume  that  the  series  (19)  for  x(M’,  T,  z)  has  a 
finite  radius  of  convergence  which  for  increasing  N  has  a  finite  lower 
bound.  Then  also  the  limit  series: 

« 

(25)  Lim  jrfJVr,  T,  :)  =  x(T.  z)  =  V  b,(T)z‘ 

s  l-i 

will  have  a  finite  radius  of  convergence.  Since  from  (23)  follows  that 
for  increasing  r,  z0  goes  to  zero,  we  can  make  v  so  large  that  z0  is 
inside  the  circle  of  convergence  of  x(T,  :).  One  then  obtains  from 
(22)  that  for  large  TV: 

(26)  l//=  -  kT  In  Z  =  N  [^r.  T)  + 
with: 

(27)  0(r,  T)  =  -kT{vx{T,  z0)  -  In  (A3z0)}. 

This  verifies  the  statement  made  at  the  end  of  Chapter  I  (Eq.  (51)) 
that  in  the  limit  the  free  energy  ¥>  is  proportional  to  N,  so  that  it  is 
an  extensive  variable  as  desired.  In  (27)  z0  is  determined  by: 
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(28) 

ti  ^0  ^0) 

V  CZ  0  . 

or: 

1 

(A) 

For  the  equation  of  state  one  then  obtains: 

P  =  - 

so  by  using  (28): 

(B) 

j =  X(T,  z0)  =  2bt(T)z'0. 

IfWi 
z0  dv) 


The  equations  (A)  and  (B)  are  the  Mayer  equations.  They  determine 
parametrically  the  relation  between  p  and  v,  that  is  the  equation  of 
state.  Note  still  that : 


(29)  /!=</«+/)(  =  kT  In  (A 3z0) 

which  shows  the  thermodynamic  meaning  of  z0;  /t  is  called  the  chemical 
potential. 

Since  A,  s  I  one  gets  from  (A)  that  for  very  large  r,  z0  2  l/t>„ 
and  then  from  (B)  the  ideal  gas  law  follows.  Eliminating  in  this  way 
the  r0  by  successive  approximation  one  obtains  the  Kamerlingh  Onnes 
expansion  (3)  with: 

B(T)  =  -  Nh(T)  =  -  3  f drl2f(rl2) 

<»> 

=  2rrN  J  drr2(\  -  e  v"’kT), 

C{T)  =  N2[-2h,(T)  +  Mb2)2} 


(31) 


—  ~  2  <^13/12/13/23 

a ir2N2  r*  C *  Cn 

- - —  dr ,r?  dr2rl  dO  sin  0/(r,)/(r2) 

+  r\  -  2rir3  cos  d)'2) 


and  so  on.  This  shows  explicitly  how  to  find  the  first  few  virial 
coefficients  in  terms  of  the  intermolecular  potential. 

The  virial  coefficients  and  especially  B(T )  have  been  one  of  the  best 
sources  of  our  knowledge  of  the  intermolecular  or  van  der  Waals’ 
forces.  To  get  some  idea  of  the  temperature  dependence  of  B(T), 
suppose  that  <£(/•)  consists  of  a  hard  core  plus  an  attraction  ~ 


46  THEORY  OF  THE  NON-IDEAL  GAS 


(see  Fig.  8).  Expanding  the  exponential  in  (30)  one  gets  approxi¬ 
mately 

with  b  =  2rr\rl/3,  a  =  2nN2\/3d'i,  which  is  precisely  what  one  would 
obtain  from  the  van  der  Waals  equation.  As  every  consequence 


from  that  equation,  this  equation  represents  qualitatively  the  observed 
dependence  of  B  on  T  (see  Fig.  9).  Unfortunately  B(T )  does  not 
determine  uniquely,  so  that  one  can  only  make  some  Ansatz  for 


Figure  9 

<f>(r)  such  as  the  ( 1 2-6)  Lennard-Jones  potential,  and  then  see  whether  one 
can  adapt  the  constants  f  and  a  so  as  to  reproduce  the  observed  B(T). 
A  great  deal  of  work8  has  been  done  in  this  direction  with  quite 
satisfactory  results,  so  that  at  least  for  the  inert  gases  one  has  a  good 
idea  of  the  shape  and  the  strength  of  the  intermolecular  force  <f>(r). 
With  the  <f>(r)  obtained  from  B(T)  one  then  can  calculate  C(T)  and  the 
check  with  the  experimental  values  is  again  satisfactory.  This  is  one 
of  the  strong  arguments  for  the  assumption  that  the  intermolecular 
forces  are  pair  forces. 
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7.  The  second  Mayer  theorem.  The  elimination  of  z0  between  the 
two  Mayer  equations  (A)  and  (B)  can  be  done  in  general  and  this 
leads  to  a  striking  general  expression  for  the  nth  virial  coefficient 
which  is  also  due  to  Mayer.  It  is  based  on  a  theorem,  which  again 
we  will  first  formulate  and  prove  in  a  more  general  form. 

Theorem  II.  Let  the  weight  W(CP)  associated  with  a  connected  graph 
with  p  labelled  points  have  the  properties : 

(a)  W(CP)  is  independent  of  the  labelling  of  the  points. 

(b)  W(CP)  =  n  where  the  product  goes  over  all  the  stars 

Sm  out  of  which  Cr  is  constructed. 

Let-. 

(32)  f,  =  2  WC,),  rM  -  ^ 

«V  (S„) 


where  the  sums  go  over  all  different  labelled  connected  graphs  of  p 
points,  resp.  stars  of  m  points.  Then  the  theorem  states  that : 


(33) 

where : 

(34) 

(35) 

Proof.  Consider  first,  instead  of  general  connected  graphs,  pure 
star  trees  in  which  the  single  constituent  star  has  q  points,  weight  W 
and  symmetry  number  s.  The  generating  function  S(y)  consists  then 
of  one  term,  namely : 

(36) 

*»  -  -  *?■ 

To  construct  all  possible  star  trees,  it  is  simplest  to  consider  so-called 
rooted  star  trees,  in  which  one  point  (the  root)  is  given  besides  its 
label  a  special  designation,  say  a  color.  We  now  build  up  the  star 
tree  from  this  root.  In  the  rooted  star  tree,  call  the  stars  which  have 
the  root  point  in  common  the  main  leaves.  If  T(: )  is  the  generating 
function  for  the  rooted  star  trees,  then 

TTl 

T(z)  =  J  Tn(z) 

n--0 


(37) 
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where  Tn(z)  is  the  generating  function  for  rooted  star  trees  with  n 
main  leaves,  and  T0(z)  =  z  counts  the  single  point. 

Now  we  can  construct  rooted  star  trees  with  n  main  leaves  by 
selecting  n  rooted  star  trees  with  one  main  leaf  and  hanging  them  all 
together  on  the  root.  From  the  product  theorem  follows: 


since  Ti(z)/z  is  the  generating  function  for  rooted  star  trees  with  one 
main  leaf  in  which  the  root  point  is  not  counted;  the  factor  z  accounts 
for  the  common  root  point  and  the  factor  n\  is  needed  because  the 
n  parts  are  chosen  from  the  same  collection  of  rooted  star  trees  with 
one  main  leaf  and  any  permutation  of  the  n  parts  leads  to  the  same 
star  tree  with  n  main  leaves. 

Next  one  can  construct  a  rooted  star  tree  with  one  main  leaf  by 
hanging  (q  -  I)  general  rooted  star  trees  with  their  roots  on  the 
(q  -  I)  points  of  the  main  leaf  which  are  different  from  the  root. 
Hence  applying  again  the  product  theorem  one  has: 

(39)  r,(r)  =  [r(r)]«  '. 

The  factor  z  accounts  for  the  root  and  W  for  the  weight  of  the  main 
leaf.  The  factor  q  is  needed  since  one  can  choose  any  point  of  the 
single  main  leaf  as  the  root  point.  Finally  the  factor  1/s  accounts 
for  the  fact  that  there  are  s  equivalent  arrangements  of  root  and 
{q  -  I)  rooted  star  trees  on  the  main  leaf  because  of  its  symmetry. 
Combining  (37),  (38)  and  (39)  one  gets: 


and  from  (36)  one  sees  that  in  this  case: 


dS(T) 

s  dT 

The  generalization  to  mixed  star  trees  and  therefore  to  general 
connected  graphs  is  quite  straightforward.  In  building  up  the  rooted 
star  trees  out  of  rooted  star  trees  with  one  main  leaf,  one  must  now 
distinguish  between  the  different  types  of  stars  which  can  be  the  main 
leaf.  Instead  of  (39)  one  gets: 

r,(r)  =  [7W-1 
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where  the  sum  goes  over  all  types  of  stars  and  where  each  term  in  the 
sum  counts  rooted  star  trees  with  one  main  leaf  which  is  a  star  of  the 
ith  type.  Instead  of  (40)  one  gets: 


T(z)  =  z  exp-^2  T"' ' 1  j- 

and  since: 


S(y)  =  ^  r, 

m  -  2 


this  leads  to  (33). 
Finally  the  relation: 


between  the  generating  functions  f(z)  and  T(z)  of  connected  graphs 
and  rooted  connected  graphs  is  obvious  since  with  labelled  graphs  the 
special  designation  of  any  point  as  the  root  point  leads  to  a  different 
rooted  graph,  so  that  since  the  weight  is  independent  of  the  labelling 
the  total  weight  of  the  rooted  connected  graphs  of  p  points  must  be 
Pf,-" 

Let  us  now  return  to  the  Mayer  equations.  The  Mayer  weight 
associated  with  a  connected  graph  CP  of  p  points: 

(41)  W(CP)  =  ■  ■  ■j  dr,  dTp\  \  j„ 


has  for  large  V  (strictly  only  in  the  limit  V  -«•  oo)  the  two  properties 
required  for  the  applicability  of  Theorem  II.  The  first  property  is 
obvious.  To  prove  the  second  property  one  integrates  in  (41)  first 
over  all  the  points  of  the  end  point  stars  in  C’p  except  the  articulation 
points  which  connect  them  with  the  rest  of  the  star  tree.  Since  the 
/„  depend  only  on  the  relative  coordinates  r,  -  r,  the  result  of  the 
integration  gives  for  large  V  the  product  of  the  weights  of  the  end 
point  stars,  which  will  be  independent  of  the  coordinates  of  the  articu¬ 
lation  points.  Continuing  this  procedure  one  clearly  gets  that  W(CP) 
is  the  product  of  the  weights  of  all  the  stars  out  of  which  Cp  is  con¬ 
structed. 

With  (41)  T{z)  becomes  z  d\/dz  which  is  equal  to  1/r  according  to 
Mayer's  equation  (A).  Calling: 

(42>  a-  -  Hi  W=T)i  z,  n  /. 
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where  the  sum  goes  over  ali  stars  with  m  labelled  points,  clearly  the 
total  weight  rm  becomes  ( m  -  l)!/3m-i  and  hence: 


(43) 


=  2  &yv  =  <Hy)- 


Eq.  (34)  therefore  becomes: 


(44) 


I 


=  z0  exp  U 


which  is  the  expression  of  the  second  Mayer  theorem.  One  can  say 
that  in  (44)  by  expressing  2  in  l/r  one  has  inverted  the  first  Mayer 
equation  (A).  From  Mayer's  equation  (B)  one  then  obtains  (calling 
l/r  =  x) 


dz 


x 

z 


(45) 


=  f  eolv)  d(ye  v,v))  =  x  -  f  dy  y<f>'(y) 
Jo  Jo 


r^r 


which  is  the  virial  expansion.  Clearly  the  nth  virial  coefficient  is: 
n  -  I 


(46) 


=  - f  ••  f  2 n ft- 

n\  v-®  r  Ji-  jv  (s  >  s 


The  expressions  (30)  and  (31)  for  the  second  and  third  virial  coefficient 
are  special  cases  of  (46).  For  the  fourth  virial  coefficient  (46)  gives: 


B 4 


stH?  f-'-f*. 1  +  p^[ 


+ 


(3) 


(6) 


(I) 


where  the  numbers  refer  to  the  numbers  of  terms  of  the  types  sym¬ 
bolized  by  the  graphs. 
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Finally  one  should  emphasize  that  such  a  general  expression  for 
the  virial  coefficients  can  only  be  derived  when  the  intermolecuiar 
forces  are  pair  forces  or  have  the  additivity  property.  For  non¬ 
additive  forces  (and  therefore  in  the  quantum  theory!)  the  Mayer 
equations  (A)  and  (B)  are  still  valid  with  the  appropriate  definition 
of  the  blt  but  the  virial  expansion  can  only  be  obtained  by  the  succes¬ 
sive  elimination  of  the  parameter  z0. 

8.  The  convergence  question  and  the  Gaussian  model.  I  mentioned 
in  §6  that  one  assumes  that  the  radius  of  convergence  of  the  series  (19) 
for  x(M>,  7,  z)  has  for  increasing  N  a  finite  lower  bound  which  is  then 
the  radius  of  convergence  of  the  Mayer  series: 

x(T,  z)  =  y  b,(T)z< 

i  - 1 

which  implies  that  also  the  virial  series  has  a  finite  radius  of  con¬ 
vergence.  I  think  that  all  this  is  very  likely  to  be  true  but  a  strict 
proof  does  not  exist.10  It  is  clear  that  everything  will  depend  on  the 
form  of  the  intermolecuiar  force  <Hr).  If  4>(r)  has  a  hard  core,  then 
we  saw  already  that  Ztr(  V,  T,  z)  is  a  polynomial  in  z  with  positive 
coefficients  starting  from  one.  Since: 

x(  V,  T,  z)  =  2  />,(  V.  7 V  =  l  In  Z,r(  V,  T,  z) 

i- 1  v 

clearly  *(  F,  T,  z)  will  have  for  any  V  a  finite  radius  of  convergence, 
which  is  determined  by  the  zero  of  Ztr(z)  closest  to  the  origin.  What 
happens  for  V  -*  oo,  and  also  for  other  molecular  forces  in  which 
say  <f>(r)  ~  r~"  for  small  r  (as  in  the  Lennard-Jones  potential),  is  not 
known  and  seems  to  me  difficult  to  settle  without  having  an  idea 
about  the  behaviour  of  the  b,  for  large  I.  For  this  reason  we  have 
tried  to  investigate  the  asymptotic  behaviour  of  the  b ,  for  a  special 
model,  the  so-called  Gaussian  model ,  and  although  we  have  not  reached 
any  definite  results,  it  seems  to  me  that  the  mathematical  problems 
one  runs  into  are  so  intriguing,  that  they  warrant  a  short  discussion.11 

For  a  purely  repulsive  potential  #/■),  the  Mayer  function  /(r)  will 
always  be  negative  and  will  look  as  shown  in  Fig.  10.  For  a  soft 
repulsion  it  is  then  tempting  to  replace//  by  the  Gaussian  function: 

(47)  .[,  =  -cxp[— «|r,  -  r,|s]. 

Mind  that  this  does  not  correspond  to  a  real  physical  potential,  since 
with  (47)  (ftr)  would  be  temperature  dependent.  But  if  we  are 
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interested  only  in  the  behaviour  of  the  gas  at  fixed  temperature,  (47) 
should  represent  qualitatively  the  effect  of  a  soft  repulsive  potential. 

With  (47)  all  cluster  integrals  can  of  course  be  performed.  It  is 
even  possible  to  do  this  for  a  finite  volume  if  we  put  the  gas  in  a 
Gaussian  vessel,  that  is  if  we  represent  the  effect  of  the  walls  of  the 
vessel  by  a  Gaussian  potential  exp(  -  /Sr2),  corresponding  to  a  “volume” 
(Tr//S)a  2  and  of  course  with  /3  «.  However,  we  have  looked  mainly 

at  the  limit  V  »  cc,  that  is  at  b,(T).  It  is  easy  to  show  that  in  this 
case  the  weight  (41)  becomes: 

(48)  ^(<Tr)  =  (- Dv(~)  l d(C,))-3' 3 

where  k  is  the  number  of  lines  in  Cp  and  where  d(Cr)  is  the  so-called 
graph  complexity.  This  is  defined  in  terms  of  the  graph  matrix 
d„(Cp),  which  is  a  p  by  p  matrix  with  elements  defined  by: 

{-  I  if  the  line  (1,7)  occurs  in  Cp 

U*j) 

0  otherwise 

du  =  degree  of  the  point/  in  C„,  that  is  the  number  of  lines 
incident  on  /. 

It  is  easy  to  see  that  the  determinant  ||</t/||  is  zero  and  that  all  minors 
of  order  (/?  -  I)  are  equal.  Their  common  value  is  the  graph  com¬ 
plexity.  Eq.  (48)  shows  that  the  weight  of  the  graph  becomes  smaller 
in  absolute  value  the  more  complex  the  graph  is,  which  is  clearly  as 
it  should  be.  It  is  also  of  interest  to  note  the  explicit  dependence  of 
1 V(Cr)  on  the  number  of  dimensions. 


THE  CONVERGENCE  QUESTION  53 


Many  properties  of  graph  matrices  are  known,  but  let  me  only 
mention  the  following  two  theorems:12 

(a)  The  graph  complexity  diCP)  is  equal  to  the  number  of  labelled 
Cayley  trees  of  p  points  which  are  subgraphs  of  Cv. 

(b)  The  sum  of  the  complexities  for  all  graphs  Cpk  of  p  labelled 
points  and  k  lines  is  given  by: 


(49) 


P  +  1 


Clearly  if  we  would  know  the  distribution  of  the  values  of  the  graph 
complexity  for  all  connected  graphs  of  p  points  and  k  lines,  then 
one  would  be  able  to  find  an  explicit  expression  for  bp.  In  fact,  if 
nip ,  k,  d )  is  the  number  of  graphs  of  p  points,  k  lines  and  complexity  d, 
then  from  (48)  and  the  definition  of  b„  it  follows  that: 


(50) 
where : 


|  p<p  -  1  )/2 

*p  =  <2 w-'i  2  <-‘>k2 

"•  *«p-  i  d 


nip ,  k,  d) 
d32 


is  the  analogue  of  the  van  der  Waals’  b  for  this  case  (since  bz  =  -  b ) 
and  where  the  sum  over  k  goes  over  all  the  possible  numbers  of  lines 
which  can  occur  in  a  connected  graph  of  p  points.  Of  course,  for 
finite  p  and  k  only  a  discrete  set  of  values  of  d  is  possible  so  that 
nip,  k,  d)  is  a  step  function,  but  if  p  increases  and  for  k  in  the  middle 
of  its  range  actual  numerical  experience  (all  complexities  for  graphs  up 
to  p  -  7  have  been  computed)13  suggests  that  n(p,  k,  d)  becomes  a 
genuine  smooth  distribution  function  which  in  fact  seems  Gaussian 
around  its  average  value.  It  would  be  of  great  interest,  I  think,  if 
these  statements  could  be  proved.  Unfortunately  at  present  all  we 
know  about  the  distribution  are  the  zeroth  and  first  moments,  since: 

2  n(p,  k,  d)  =  Cpk 


where  Cpk,  the  number  of  connected  ip,  k)  graphs  can  be  found  from 

(c).  Note  7,  and: 


d 


k  -  p  +  1 


which  is  a  rewriting  of  (49).  We  have  been  unable  to  compute  higher 
moments.  Already  the  second  would  be  worth  knowing! 
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1.  The  equilibrium  pressures  of  the  solid-vapour  (sublimation), 
solid-liquid  (melting)  and  liquid-vapour  (evaporation)  equilibria  as 
function  of  the  temperature  meet  in  the  triple  point  A  and  divide  the 
(p,  T)  plane  into  three  regions  (see  Fig.  a).  We  saw  that  the  liquid- 
vapour  line  stops  at  the  critical  point.  It  is  therefore  possible,  by 
going  around  the  critical  point,  to  change  the  vapour  into  a  liquid 


without  ever  having  the  two  phases  at  the  same  time.  One  can  make 
a  continuous  transition  from  the  vapour  to  the  liquid  state.  At 
present  it  is  unknown  whether  the  same  is  true  for  the  fluid  and  the 
solid  state,  or  in  other  words  whether  the  melting  line  AB  stops  in  a 
second  critical  point.  It  seems  unlikely,  because,  as  Landau  has 
pointed  out,  a  solid  and  a  fluid  are  qualitatively  different.  A  fluid  is 
just  a  very  dense  gas,  but  in  a  solid  there  is  a  long  range  order  in  the 
positions  of  the  molecules,  and  it  is  difficult  to  imagine  how  this  could 
develop  gradually  from  the  random  arrangement  of  the  molecules  in 
a  gas.  Experimentally  it  is  known  that  in  some  cases  the  melting  line 
goes  to  much  higher  temperatures  than  Tcrlt.  One  can  solidify 
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Helium  by  high  pressures  at  SO°K,  which  is  about  ten  times  the  critical 
temperature. 

2.  The  various  “derivations”  of  the  van  der  Waals  equation  have 
in  common  that  the  effect  of  the  repulsive  forces  (the  “6”  correction) 
and  the  effect  of  the  attractive  forces  (the  “a”  correction)  are  treated 
separately.  One  can  argue  for  instance  as  follows: 

Omitting  the  attractive  forces  and  taking  the  repulsive  force  as 
between  elastic  spheres  of  diameter  d,  the  partition  function  becomes: 

(a)  Z  =  jJpg  0  S(  |r,  -  r,|) 

where 


S(r„) 


(0  if  r„  <  d, 

\l  if  r„  >  d. 


This  means  that  one  has  to  find  the  volume  of  a  3/V  dimensional  cube 
(taking  V  cubical)  from  which  are  excluded  the  cylindrical  regions 
|r,  -  t, |  g  d.  This  is  a  very  difficult  problem!  However,  if  V  >  Nv0 
where  i>0  =  4w</3/3,  then  in  first  approximation  one  can  neglect  the 
overlapping  of  the  action  spheres,  so  that  the  integral  in  (a)  becomes: 


J-ViV-  i-oKK  -  2r0) •  (V  -  (N  -  I )r0) 

-^n('-T) 

Hence: 

In  J  =  N  In  V  +  *2  ln(l  - 

=  Mn  V-2t 

=  jV  In  K  -  ^  s  N\n(V  -  b) 

if  b  =  Ni'o/2  -  4  x  proper  volume  of  the  molecules.  Therefore 
J  =  (V  -  b)N  and  we  will  keep  this  form  even  when  V  is  not  large 
compared  to  b. 

Now,  to  take  into  account  the  attractive  forces,  one  can  say  that  if 
the  range  of  these  forces  is  sufficiently  long,  the  average  potential 
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energy  will  be  -CNal 2  V,  since  each  molecule  attracts  all  molecules 
in  its  action  sphere  and  the  number  of  these  molecules  will  be  pro¬ 
portional  to  the  number  density  N/V.  C  is  a  positive  constant  and 
the  contribution  to  the  partition  function  is  therefore  roughly 
exp(  +u/VkT)  with  u  =  CNa/2.  Altogether  one  gets  therefore: 

(b)  z  =  i 

and  from  p  =  kTi)(\n  Z)/dV  van  der  Waals  equation  follows. 

3.  This  was  first  pointed  out  by  J.  de  Boer  (Dissertation,  Amster¬ 
dam,  1940;  Physica  14  (1948)  139).  It  is  important  to  note  that  the 
mass  of  the  molecule  does  not  affect  the  equation  of  state  in  the 
classical  theory  to  which  we  restrict  ourselves.  In  the  quantum 
theory  this  is  no  longer  true.  The  equation  of  state,  using  the  same 
intermolecular  potential,  then  depends  on  a  third  dimensionless 
variable,  the  so-called  de  Boer  parameter: 

.  h 
o(/nc)1/a' 

See  also  the  dissertation  of  R.  J.  Lunbeck  (Amsterdam,  1951). 

4.  H.  D.  Ursell,  Proc.  Cambridge  Philos.  Soc.  23  (1927)  685.  The 
work  of  Mayer  is  summarized  in  the  book  of  J.  E.  Mayer  and  M.  G. 
Mayer,  Statistical  Mechanics  (Wiley,  New  York,  1940).  For  the 
generalization  to  the  quantum  theory  see  B.  Kahn  and  G.  E.  Uhlen- 
beck,  Physica  5  (1938)  399;  B.  Kahn  (Dissertation,  Utrecht,  1938). 
For  the  generalization  to  the  Bose  or  Fermi  statistics  see  T.  D.  Lee 
and  C.  N.  Yang,  Phys.  Rev.  113(1959)  1 165. 

5.  Most  of  the  terminology  is  the  same  as  in  the  book  of  Kdnig 
(Theorie  der  Endlichen  und  Unendlichen  Graphen,  Leipzig,  1936). 
There  is  no  special  term  for  a  graph  without  articulation  points,  and 
it  is  convenient  to  have  one.  The  word  star  and  the  corresponding 
notion  of  star  tree  were  introduced  in  our  paper  on  the  combinatorial 
problems  in  the  theory  of  graphs  (Proc.  Nat.  Acad.  Sci.  U.S.A.  42 
(1956)  122). 

6.  The  following  presentation  of  the  Mayer  theory  follows  a  review 
article  by  G.  W.  Ford  and  G.  E.  Uhlenbeck  entitled:  The  theory  of 
linear  graphs  with  applications  to  the  theory  of  the  virial  development 
of  the  properties  of  gases,  which  appears  in  Studies  in  statistical 
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mechanics  Vol.  I  (North  Holland  Publishing  Company,  Amsterdam, 
1962). 

7.  If  one  takes  in  Theorem  I  for  the  weight  W,  yk,  where  y  is  an 
arbitrary  variable  and  k  is  the  number  of  lines  in  the  graph  GN  or  C,, 
then  it  is  clear  that  W  fulfills  all  the  requirements,  and  that  therefore 
Theorem  I  is  applicable.  One  obtains: 


(C) 

where: 

1  +  N(x,  y)  =  eC(l  »> 

(d) 

N(*>y)=2^  2  ;w. 

p-l  V-  k-0 

(e) 

®  v-p  PIP- l)/2 

c(x,  y)  -  2,  2  c**yk 

P« 1 r-  k • p - l 

and: 

Nvk  -  number  of  labelled  (p,  k)  graphs  (=  graphs  of  p 
points  and  k  lines) 

(0 

Cpk  =  number  of  connected,  labelled  (p,  k)  graphs. 

Eq.  (c)  was  first  derived  by  R.  J.  Riddell  (Dissertation,  University  of 
Michigan,  1951).  It  allows  the  calculation  of  the  numbers  Cpk  in 
terms  of  the  known  binomial  coefficients  Npk. 

8.  For  a  complete  account  see  the  book  by  J.  O.  Hirschfelder, 
C.  F.  Curtiss  and  R.  B.  Bird,  The  Molecular  Theory  of  Gases  and 
Liquids  (Wiley,  New  York,  1954).  A  very  useful  summary  of  the 
theoretical  calculations  of  the  virial  coefficients  is  given  by  T.  Kihara, 
Revs.  Modern  Phys.  25  (1953)  831 ;  27  (1955)  412. 

9.  By  taking  as  in  Note  7  for  the  weight  W,  y *  ( k  =  number  of 
lines  in  the  graph)  one  sees  that  the  two  requirements  for  the  weight 
WiCp)  are  fulfilled  so  that  Theorem  II  is  applicable.  Writing: 

®  PIP  -  l>/2 

(g)  S(x,  y)  =  2  2  •W* 

p>ir  k-p-1 

where  Spk  is  the  number  of  stars  with  p  labelled  points  and  k  lines 
(with  the  convention  S2 1  =  1),  and  putting: 
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where  C(x,  y)  is  giveii  (e),  then  Theorem  II  yields  the  second  Riddell 
formula : 


(h) 


£s(r,y)  =  ln^- 


From  (h)  one  can  find  Spk  in  terms  of  the  Cpk.  It  is  not  difficult  to 
show  that  for  large  p  and  k  >  p  In  p  the  numbers  Npk>  Cpk  and  Spk 
are  asymptotically  equal,  so  that  one  can  say  that  for  large  p  and 
k  >  p  In  p  the  overwhelming  majority  of  the  graphs  are  stars. 

10.  For  a  time  the  convergence  question  seemed  (even  to  the 
physicist!)  of  great  interest  because  Mayer,  Kahn  and  Uhlenbeck  (see 
Note  4  for  the  literature)  had  speculated  and  had  tried  to  prove  that 
the  nearest  singularity  of  y(7~,  z)  on  the  positive  real  z-axis  (assuming 
that  there  x  and  dxldz  remain  finite)  would  determine  the  point  of 
condensation  of  the  gas.  This  was  strongly  suggested  by  the  case  of 
the  ideal  Bose  gas,  which  is  of  course  not  a  special  case  of  the  non¬ 
ideal  gas  theory  we  have  discussed  (since  the  deviations  of  the  ideal 
gas  laws  are  in  this  case  due  to  quantum  effects),  but  which  is  a  case 
for  which  the  corresponding  h,  can  be  evaluated  explicitly.  In  fact, 
already  in  1925  Einstein  derived  for  the  Bose  gas  the  equations: 


(i) 


1  -  I  V  *L 
7  ~  A3  A  I33' 


jl  -  rL  s  — 

kT  A3 ,4-,  P3 


which  have  precisely  the  same  form  as  the  Mayer  equations  if  one 
puts: 

A  A3(l  _1) 

*  =  X3’  h w  -  -757 5- 


Clearly  the  series: 

®  A1 

m  =  1 4 

1- 1  * 

is  convergent  up  to  A  =  1,  which  is  the  nearest  singularity  on  the 
positive  real  axis  of  the  analytic  function  generated  by  the  series.  The 
point  A  =  I  is  a  branchpoint  of  order  two  and  x  and  dx/dA  are  finite 
at  this  point.  It  is  known,  and  it  can  be  proved  in  various  ways,  that 
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at  this  point  a  phase  transition  occurs,  the  so-called  Bose-Einstein 
condensation.  For  values  of  v  smaller  than: 

A3  A3 

r'  ~  ±r6'2  ~  2.612.  • 

the  pressure  stays  strictly  constant  till  v  =  0.  Also  the  convergence 
of  the  virial  series  which  follows  from  (i)  by  eliminating  A  has  been 
investigated  and  it  has  been  shown  (W.  H.  J.  Fuchs,  J.  Rational 
Mech.  Anal.  4  (1955)  647)  that  the  virial  series  converges  beyond  the 
condensation  point.  In  fact  Fuchs  could  show  that  the  radius  of 
convergence  of  the  virial  series  (in  A 3/r)  lies  between  12.56  and  27.73. 

Unfortunately,  by  now  it  seems  very  likely  that  the  connection 
between  the  condensation  point  and  the  singularity  on  the  positive 
real  axis  of  the  function  \  for  the  ideal  Bose  gas  is  a  kind  of  remarkable 
fluke,  and  that  for  real  gases  there  is  no  such  a  direct  connection. 
We  come  back  to  this  point  in  Chapter  111. 

11.  To  our  knowledge  the  first  use  of  the  Gaussian  model  was  by 
E.  W.  Montroll,  T.  H.  Berlin,  and  R.  W.  Hart.  The  article  appears  in 
Chungementsde phases,  Societede  Chimie  Physique,  Paris,  1952,  p.  212. 

12.  The  first  theorem  is  due  to  Kirchhoff  ( Gesammehe  Abhandlungen , 
Leipzig,  J.  Barth,  1 882,  p.  22).  If  one  takes  for  Cp  the  complete  graph 
of  p  points,  which  has  the  complexity: 


then  this  must  therefore  be  the  number  of  Cayley  trees  with  p  labelled 
points,  which  is  a  well  known  result  first  derived  by  Cayley. 

The  second  theorem  is  due  to  G.  W.  Ford.  It  can  be  proved  in  the 
following  way.  From  the  Kirchhoff  theorem  follows  that  one  can 
write: 

2  «*■*>- 2  2  1 

<cp*>  (Vector, 

where  the  first  sum  is  over  all  Cayley  trees  of  p  points  and  the  second 
sum  is  over  those  (p,  k )  graphs  which  contain  a  given  Cayley  tree  Tp. 
Now  the  number  of  connected  graphs  with  p  labelled  points  and  k  lines 
which  contain  a  particular  Cayley  tree  is  just  the  number  of  ways 
(k  -  p  +  I)  lines  can  be  distributed  among  the  (1/2 )(p  -  l)(p  -  2) 
pairs  of  points  in  the  Cayley  tree  which  have  no  line.  Therefore: 
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and  since  there  are  pT  ~ 2  Cayley  trees  with  p  points,  Eq.  (49)  follows. 

13.  For  tables  see  the  article  referred  to  in  Note  6.  Note,  that  one 
can  formulate  the  problem  of  finding  the  distribution  function  rt(p,  k,  d) 
also  in  the  following  way.  Find  the  distribution  of  the  first  principal 
minor  of  random ,  symmetric  p  by  p  matrices  in  which  with  equal  a 
priori  probability  k  of  the  p(p  -  l)/2  off-diagonal  elements,  say  on 
the  right  of  the  main  diagonal,  are  set  equal  to  minus  one  and  the  rest 
equal  to  zero.  The  matrix  is  then  completed  by  taking  each  diagonal 
element  equal  to  the  sum  of  the  elements  on  the  corresponding  row 
(or  column)  with  the  opposite  sign. 


Chapter  III 


"i 


Remarks  on  the  Condensation  Problem 


1.  Introduction.  Although  the  phenomena  of  condensation  and  other 
phase  transitions  (like  melting)  are  so  very  familiar,  they  are,  when 
one  tries  to  think  about  them,  rather  mysterious.  How  do  the 
molecules  of  the  vapour  “know’*  that  at  a  definite  specific  volume 


Figure  1 

they  should  condense  and  form  two  phases?  And  why  is  the  transi¬ 
tion  so  sharp  ?  Clearly  there  are  two  arrangements  of  the  molecules 
with  “probabilities"  depending  on  v  which  switch  over  at  the  critical 
value  vlw  This  is  confirmed  by  the  phenomena  of  supersaturation  and 
supercooling,  represented  by  extensions  of  the  vapour  and  liquid  parts 
of  the  isotherm  and  in  which  the  vapour  and  liquid  are  in  metastable 
states  (see  Fig.  I).  For  van  der  Waals  these  phenomena  are  of  course 
no  mystery  and  in  fact  seem  to  confirm  his  views.  However,  once 
one  gives  up  the  van  der  Waals  equation  this  picture  makes  one  doubt 
whether  the  condensation  phenomenon  can  come  out  from  the  basic 
equation : 
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(1)  e  tr  =  Z  =  ^5*  jv  ‘  jvdr i  •  exp[-/S  ^ 
where  0  =  l//c7'. 

At  one  time  people  thought  that  perhaps  the  stable  isotherm  could 
not  be  derived  from  (1)  without  further  assumptions.  Perhaps  one 
should  make  a  separate  calculation  of  V  for  two  different  arrange¬ 
ments  of  the  molecules,  one  corresponding  to  a  homogeneous  phase 
and  one  corresponding  to  the  two  phase  system.  For  each  volume 
one  would  then  obtain  two  values  of  V  and  the  real  isotherm  would 
be  determined  by  the  lowest  value  of  'V.  Although  such  calculations 
for  phase  transitions  between  different  solid  phases  have  given  good 
results,  it  is  I  think  now  the  accepted  opinion  that  they  are  in  principle 
not  correct.  The  integral  (I)  should  always  give  the  most  probable 
state  of  the  system,  which  is  the  state  of  stable  equilibrium.  The 
stable  isotherm  should  therefore  follow  automatically  from  (1)  without 
further  assumptions. 

However,  from  the  mathematical  standpoint  it  is  hard  to  imagine 
how  the  harmless  Eq.  (I)  could  lead  to  a  'V  (and  therefore  to  a  p) 
with  such  sharp  discontinuities.  The  answer  to  this  lies  in  the  fact 
that  one  is  interested  in  a  limit  property  of  V.  The  problem  has  a 
physical  sense  only  when  the  system  is  very  large.  One  is  therefore 
led  to  the  investigation  of  the  limit  N  -*  go,  V  oo,  v  =  VfN  finite. 
One  can  then  prove  that  the  limit: 

(2)  T)  =  Lim  l  nNv,  T,  N) 

always  exists.  It  is  for  this  function  i/4.v,  T)  that  one  should  expect 
the  sharp  discontinuities,  and  although  perhaps  still  surprising  (and 
certainly  not  proved)  it  is  clearly  mathematically  conceivable. 

Finally,  how  about  the  supersaturation  phenomenon?  If  (1)  gives 
the  stable  isotherm,  then  it  can  not  explain  supersaturation.  To 
explain  this  strictly  from  the  partition  function  one  must  think  of 
situations  in  which  the  supersaturated  vapour  is  in  an  equilibrium 
state.  This  is  the  case  in  capillary  vessels,  and  for  the  strict  explana¬ 
tion  of  supersaturation  a  detailed  study  of  the  effect  of  the  shape  of 
the  vessel  would  therefore  be  necessary.  It  has  never  been  really 
attempted. 

2.  The  Van  Hove  theorem.1  I  said  already  that  one  can  show  that 
the  limit  (2)  exists.  This  was  proved  rigorously  first  by  L.  Van  Hove 
who  could  in  addition  show  that  p  =  -  r^/r» v  is  a  monotonic  decreas¬ 
ing  function  (or  better  a  never  increasing  function)  of  v.  This  con- 
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firms  therefore  that  one  never  can  get  *  van  der  Waals  like  isotherm 
from  (I),  but  of  course  it  does  not  show  that  there  must  be  a  hori¬ 
zontal  portion  in  the  isotherm. 

A  simplified  proof  of  these  results  which  probably  can  be  made  rigor* 
ous  goes  as  follows. 


Assume  that  <f>(r)  has  a  hard  core  r0,  and  that  for  r  >  r0  the  force  is 
attractive,  range  r,  and  with  a  lower  bound  for  the  potential  equal  to 
-r  (see  Fig.  2).  I  am  sure  that  these  special  assumptions  can  be 
relaxed,  but  let  me  stick  to  them.  Now  divide  the  (cubical)  volume 
V  =  V  in  cubical  cells  each  of  volume  y  -  <P  where  L  >  d 
The  idea  of  the  proof  consists  of  comparing  the  integral  (1)  with  the 
value  it  would  have  if  we  eliminate  all  the  interactions  between  the 
particles  in  different  cells.  Write  in  this  case  for  the  integral 

(3)  t  =  j—  Jv  •  jvc/Tr  -drNe  =  exp[-/?V'(F,  T,  W)]. 

is  the  new  total  potential  energy,  and  call  ♦  =  li  <  t  <Kr„)  the 
original  potential  energy.  Since  with  our  assumptions  the  interaction 
between  cells  is  attractive  >  <P.  On  the  other  hand  since  each  cell 
is  surrounded  by  a  layer  of  volume  (x/2/-,  in  which  there  can  be  external 
molecules  with  which  it  can  interact,  and  since  in  this  layer  there  can 
be  at  most  d2rxjro  molecules  (omitting  all  numerical  factors),  and 
since  each  of  these  molecules  can  interact  at  most  with  r\lrl  other 
molecules,  one  must  have  (since  there  are  Vjdz  cells): 
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<t>  £ 


d1rl  rj  V 
ii  r%  d* 


E  +  $. 


Hence  one  can  conclude: 

e-Ki  <  e-e*  g  exp[-/3«J  +  ppV/d] 


with  p  =  r\e/rl.  The  integration  over  tx-  ■  -  rN  does  not  change  the 
direction  of  the  inequalities  and  by  calculating  the  free  energies  *F 
and  one  obtains: 


(4) 


l'f'>iv,>  —  *?'  +  — 

N  ~  N  m  N  d 


where  v  =  VI N.  Since  increasing  N,  keeping  v  fixed,  one  still  can 
let  d  -*■  qo  say  as  Nl  8,  one  sees  that  V/N  and  V/N  will  approach  to 
the  same  limit  if  such  a  limit  exists.  Now 


(5) 


2-StS«»S;  „,v*'  fj  X  X*.  <  “Pi  - Wr,  •  ■  -r.)) 


=  S  exPl-^  2 

<»,)  <-l 

where  the  sum  is  over  all  numbers  /»,  of  particles  in  the  ith  cell,  with 
the  condition  (indicated  by  the  prime): 

2  ».  =  N, 

1 

and  where  we  have  put : 


(6)  e~tn> 


=  ^!^/r  '"j/1"  exP[“Wr»---r01- 


Many  of  the  numbers  n,  may  still  be  equal  to  each  other.  Let  ak  be 
the  number  of  cells  in  which  the  number  of  molecules  is  k,  then 


(7) 


2  = 


S*  rnh  exPi~0  2  ka«^  y)i 

<ak)  1  la*‘  k 


where  the  double  prime  indicates  the  conditions: 


(8) 

Calling: 


2a*  = 
k 


2  kak  =  N. 


gk  =  e 


fktik.r) 


one  sees  that  (7)  is  of  the  multinomial  form  and  one  knows  that  for  v 
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and  N  large  it  is  closely  approximated  by  the  maximum  term  in  the 
sum,  for  which: 

(9)  ak  =  Agkzk 

where  A  and  z  must  be  determined  from  (8).  In  fact: 

(10)  In  2  2  -  N  In  z  +  v  In  a (z) 
with: 


From  (8)  and  (9): 


<*z)  =  2  S*2*- 

k 


v  =  Ao{z ), 


N-A,± 

dz 


so  that  since  v  =  V/d3  =  Nr/d3,  one  has: 

(11)  —  =  — - 

1  '  d3  z(da/dz) 

For  fixed  d,  z  depends  therefore  only  on  r  and  from  (9)  it  then  follows 
(always  for  fixed  d)  that : 

(12)  ^ln2  s  -  Inr  +  j3\na{z)  - 


Together  with  the  inequality  (4),  this  shows  that  the  limit  (2)  exists. 
Furthermore  from  the  inequality  (4)  it  follows  that  also  p  =  -d^/dv 
and  p  =  r^/r'r  differ  at  most  by  a  constant  quantity  proportional  to 
I  Id,  and  which  can  therefore  be  made  as  small  as  one  wishes.  So 
it  is  sufficient  to  look  at  p.  From  (12)  follows: 


(13) 


P  =  frp  In  <Az), 

tp _ l_  </ln  <t(2)  dz 

inr  fid3  dz  dv 


and  from  (11)  one  gets: 

dr  d3  T  2/</a\2  d  / 

dz  ~  zHda/dz)2  lZ  \dz)  ~  aZ  Tz  \Z  Jzj \ 

<l4)  -  -  (?**-•*)(!  *1**2*)] 

■  -  s  °- 
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Since  all  gk  >  0,  one  sees  from  ( 1 3)  that  the  sign  of  dpfdv  is  determined 
by  dz/dv,  and  from  (14)  one  can  then  conclude  that  dp/dv  and  therefore 
dp/dv  <  0,  which  is  the  content  of  the  second  part  of  the  Van  Hove 
theorem.3 

3.  The  ideas  of  Yang  and  Lee.  In  a  very  interesting  paper3  Yang  and 
Lee  have  shown  a  way  how  the  condensation  phenomenon  could 
follow  mathematically  from  the  basic  equation  (1)  in  the  limit 
jV,  V  -*  oo.  v  =  V/N  fixed.  Their  starting  point  is  the  grand  canonical 
partition  function: 

Ztr(  V,  T,  z)  =  J  (WKJ,  N) 

N  -  0 

for  which  we  derived  in  Chapter  II  the  expression: 

(15)  Zgr(  V .  r.  z)  =  exp{Vx(Y<  T.  2)} 

which  is  valid  for  any  finite  V.  With  the  assumption  of  a  hard  core, 
radius  r0,  we  already  noted  that  Zgr  must  be  a  polynomial  of  degree 
M  £  V/rl.  Let  the  zeros  of  this  polynomial  be  zt,  then  (dropping 
from  now  on  the  T,  which  remains  constant  anyway): 

(16)  Z„(Kz)=l  1(1-7) 
and 

,17,  *K.-,-i|l„(l-£). 

One  can  therefore  interpret  *(  V ,  r)  as  the  complex  logarithmic  potential 
of  M  point  charges  of  strength  ]/V  situated  at  the  points  :t.  Note 
that  since  the  coefficients  of  the  polynomial  Zgr(  V,  z)  are  positive, 
none  of  the  zeros  can  be  on  the  positive  real  axis,  so  that  x(K  z), 
for  fixed  V,  will  be  an  analytic  function  of  z  along  the  positive  real 
axis. 

Let  us  now  look  at  the  limit  V  *  a  .  For  increasing  V  the  strength 
of  the  charges  become  smaller  and  smaller,  while  the  number  M 
increases.  Suppose  now  that  in  the  limit  V -*  00  a  number  of  the 
point  charges  concentrate  themselves  in  a  single  layer  AB  which  crosses 
the  positive  real  axis.  Of  course  in  addition  there  may  be  other  layers 
like  CD  and  finite  poles  Pu  P2,  ■  ■  ■  (see  Fig.  3).  The  function: 

x(z)  =  Lim  x(  V,  z) 

V-oo 


(18) 
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will  in  this  case  be  completely  defined  along  the  positive  real  axis,  but 
it  will  consist  of  two  analytic  pieces,  one  inside  the  arc  AB  of  zero’s 
and  one  outside.  From  the  theory  of  single  charge  layers  one  knows 
that  at  the  crossing  point  zc  these  two  pieces  will  be  continuous  but 
the  first  derivative  will  have  a  discontinuity. 

Clearly  for  any  finite  but  large  V,  the  argument  given  in  Chapter  II 
to  obtain  from  (15)  the  free  energy  and  hence  the  pressure  is  still  valid, 
so  that  the  isotherm  is  determined  by  the  generalized  Mayer  equations: 


(19) 


g*(*o) 

dz0 


involving  now  the  limit  function  rfz).  Because  of  the  discontinuity 
in  the  first  derivative  of  x  along  the  positive  real  axis,  the  functions 
z  dyldz  and  x(z)  will  look  about  as  shown  in  Figs.  4  and  5.  The 
isotherm  which  follows  by  eliminating  the  saddle  point  will  therefore 
have  the  desired  form  (Fig.  6).  Let  me  conclude  with  the  following 
remarks : 

(a)  The  curves  z  dx/dz  and  x(z)  are  the  limit  curves  of  a  series  of 
functions  z  tyi  K  z)/dz  and  x(  K  *)  which  are  analytic  along  the  real 
axis.  How  one  of  these  functions  might  approximate  the  limit  curve 
is  indicated  by  the  dotted  lines.  It  shows  again  that  the  condensation 
phenomenon  is  a  limit  property. 

(b)  Of  course  these  considerations  of  Yang  and  Lee  are  not  a 
complete  theory  of  the  condensation  phenomenon.  They  show  only 


68  REMARKS  ON  THE  CONDENSATION  PROBLEM 


Vg  V,  V 


Figure  6 

a  possible  (and  I  think  very  likely)  mathematical  “mechanism”,  and 
they  show  how  subtle  and  difficult  the  real  theory  will  be. 

(c)  If  the  Yang-Lee  picture  is  correct  then  the  Mayer  limit  function 
x(z)  discussed  in  Chapter  II  will  have  a  non-zero  radius  of  convergence 
and  it  will  be  identical  to  the  inner  pan  of  *(z )  for  a  region  around  the 
origin.  But  this  x(z)  or  even  its  analytic  continuation  will  not  deter¬ 
mine  the  condensation  point,  since  the  outer  part  of  *(r)  is  not  deter¬ 
mined  by  the  inner  part  of  x(z). 

(d)  The  role  which  the  hard  core  assumption  for  the  intermolecular 
force  plays  in  the  argument  may  seem  quite  striking,  and  one  might 
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even  think  that  therefore  a  gas  of  hard  spheres  will  also  show  the 
condensation  phenomenon.  This  is  of  course  not  necessarily  so. 
The  charges  may  concentrate  in  single  layers,  but  if  for  V  -*■  oo  the 
charges  do  not  close  down  on  the  real  axis,  that  is  to  say  if  there  is 
always  a  gap,  then  there  will  be  no  condensation  since  *(*)  will  then 
be  analytic  along  the  whole  real  axis. 

4.  Some  examples.4  To  make  things  more  concrete,  we  consider 
here  a  few  artificial  examples  constructed  to  illustrate  the  analytic 
features  of  the  theory.  Unfortunately,  no  one  has  ever  been  able  to 
compute  the  partition  function  for  a  realistic  model  of  a  gas,  so  our 
examples  must  be  quite  ad  hoc. 

Suppose  that  the  grand  canonical  ensemble  partition  function  has 
the  form: 

(20)  Z„(V,z)  «  (I  +  if 

where  we  take  V  to  be  an  integer.  Here  V is,  say,  the  volume  measured 
in  units  rjj  and  is  analogous  to  the  M  in  Eq.  (16).  The  canonical 
ensemble  partition  function  corresponding  to  (20)  is  obtained  by 
expanding  in  powers  of  z  and  identifying  Z(  V,  N )  as  the  coefficient  of 
zN.  We  find: 

,2i)  z<KA,)-,„i(*-  %)• 


This  model  has  been  constructed  so  that  it  has  the  general  features 
which  the  partition  function  of  a  real  gas  must  have,  i.e.  the  function 
Z,r(  K  z)  is  a  polynomial  of  degree  2  V  with  positive  coefficients  and 
the  coefficients  are  monotonically  increasing  functions  of  V  for  fixed 
N.  In  our  example  (20)  the  2V zeros  of  Zfr  are  all  on  the  unit  circle, 
half  of  them  at  z  =  -  1 ,  the  other  half  at  the  Vth  roots  of  unity  other 
than  z  -  - 1 .  There  are  no  real  positive  zeros.  It  is  now  easy  to 
show  that  for  this  model  the  limit  function  *(z)  is  given  by: 


(22) 


*(z) 


In  (I  +  z),  for  z  <  1, 

In  (I  +  z)  +  In  z,  for  z  >  1. 


This  is  just  the  logarithmic  potential  due  to  one  unit  of  charge  at 
z  =  - 1  and  one  unit  of  charge  distributed  in  a  uniform  single  layer 
on  the  unit  circle.  The  function  *(z)  is  continuous  along  the  positive 
real  axis  but  its  derivative  has  a  discontinuity  at  z  =  1.  Note  that 
x(z)  outside  the  unit  circle  is  not  the  analytic  continuation  of  *(r) 
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inside  the  circle.  Also  note  that  the  function  x(z)  introduced  in  §6, 
Ch.  II  is  just: 

(23)  \(z)  =  In  (I  +  z). 


This  function  has  no  singularities  along  the  positive  real  axis,  indeed 
its  only  singularities  are  logarithmic  branchpoints  at  z  =  - 1  and 
z  =  oo. 

The  equation  of  state  corresponding  to  (22)  is  readily  obtained 
using  the  Mayer  equations  (19).  We  find 


(24) 


v  +  2t3  +  3i>3  + 


P_ 

kT 


(In  2 


In 


id  -  «  ) 
(2v  -  l)2 


v  >  2, 


2  > 

2  1 
3>v>  2 


This  is  the  typical  form  of  the  isotherm  for  a  condensing  gas.  Note 
that  the  viriat  expansion  of  the  equation  of  state  is  convergent  for 
v  >  I  while  the  gas  condenses  at  v  =  2.  Hence,  an  important  thing 
we  learn  from  this  model  is  that  we  cannot  in  general  determine  the 
point  of  condensation  or  the  properties  of  the  condensed  phase  from 
a  study  of  the  equation  of  state  in  the  gas  phase  alone,  i.e.  from  the 
virial  series. 

Note  that  one  can  obtain  the  equation  of  state  also  directly  by 
computing  the  limit: 


using  (21)  for  Z(V,  N).  One  does  not  need  to  introduce  the  grand 
canonical  partition  function,  although  of  course  it  is  quite  a  good 
trick! 

It  may  still  be  of  interest  to  consider  a  wider  class  of  models,  namely 
those  in  which  the  zeros  of  Zgr  all  lie  on  the  unit  circle.  If  we  intro¬ 
duce  g(8)  such  that  Vg(0)  dO  is  the  number  of  zeros  of  Zwr  which  lie 
between  0  and  6  +  dO  on  the  unit  circle,  then  the  limit  function  x(z) 
will  be  the  logarithmic  potential  due  to  a  charge  distribution  with 
density  f>(8)  on  the  unit  circle,  so  that: 

(25)  x(-)  =  f  + "  dO g(0)  In  (1  —  ze~in). 


Since  Zrr  is  a  real  function  its  zeros  must  be  distributed  symmetrically 
with  respect  to  the  real  axis,  i.e. 
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(26)  gW~g(-0), 

We  can  find  further  properties  of  g(0)  by  expanding  both  sides  of  (25) 
in  powers  of  z  for  |z|  <  1.  Since  for  |z|  <  1 : 

(27)  m  =  x(z)  =  f  b,z ', 

J-  1 

one  finds: 

(28)  bt  =  -  ^  j*”  d0g(0)cosl0. 

Thus,  the  b,' s  are  directly  related  to  the  coefficients  in  the  Fourier 
cosine  series  expansion  of  g(0).  Hence  we  can  write: 

(29)  g(0)  =  I*,  --fib,  cos  10. 

i-n  it  ,fa 

Here, 

(30)  g0  =  j* *  d0g(0) 

is  the  total  charge  in  the  distribution.  From  (29)  we  see  explicitly 
that  g(0)  cannot  be  completely  determined  from  a  knowledge  of  the 
b,' s,  i.e.  from  x(z);  there  is  still  the  undetermined  constant  g0.  From 
(25)  one  can  readily  show  that  in  general: 

(31)  x(‘)  =  x(^)  +  go  In  z. 

Hence,  we  can  write 

|x(z),  !*l  <  1. 

(32)  *(*)H_/1\^  ,  ,,  . 

|x^~  j  +  go  In  z,  |z|  >  1. 

Here  we  see  explicitly  that  x  in  the  outer  region  cannot  in  general  be 
determined  from  x  in  the  inner  region. 


Notes  on  Chapter  III 


1.  L.  Van  Hove,  Physica  15  (1949)  951.  We  follow  the  presentation 
given  by  R.  J.  Riddell  in  his  dissertation  ( Contributions  to  the  Theory 
of  Condensation,  University  of  Michigan,  1951). 

2.  This  proof  should  be  compared  with  the  proof  given  by  C.  N. 
Yang  and  T.  D.  Lee,  Phys.  Rev.  87  (1952)  404,  Appendix  I.  With 
the  same  assumptions  for  the  intermolecular  potential  and  using 
essentially  the  same  method  as  Van  Hove,  Yang  and  Lee  prove  that 
the  limit: 

(a)  x(T,z)  =  Lim  pin  Z,r(  V,  T,  z) 

v- «  V 


exists.  In  addition  they  prove  that  both  x(z)  and  ;  c>xle)z  for  positive 
real  r  are  monotonically  increasing  with  increasing  z  and  that  x(z) 
must  be  continuous. 

Since  the  grand  partition  function  Z(r  is  a  polynomial  with  positive 
coefficients  from  which  for  large  N  the  partition  function  Z(Nv,  T,  N) 
follows  by  the  steepest  descent  method  (see  Chapter  II,  §7)  it  is  clear 
that  from  the  existence  of  the  limit  (a)  and  from  the  properties  of  x. 
and  z  <fx/(’z  both  parts  of  the  Van  Hove  theorem  follow.  The  question 
arises  whether  the  inverse  of  this  statement  is  also  true.  Can  one 
deduce  from  the  Van  Hove  theorem  the  existence  of  the  limit  (a)  and 
the  properties  of  *(r).  This  is  of  course  very  likely  but  a  formal 
proof  (which  we  owe  to  M.  Kac)  is  perhaps  not  superfluous. 

Let  us  assume  that  with  increasing  volume  the  vessel  always  keeps 
the  same  shape.  One  then  can  introduce  a  Laplace  transform  of 
Z„  by: 


(b) 


dVe-'vZMT(V,Tz) 


y  zN  r  dVe  ,vZ(V,  T,  N ) 

N  *  0  JO 


=  -  +  V  NzN  r  dve-NnZ(Nv,N) 
5  nT  1  Jo 


putting  V  =  Nv  and  omitting  from  now  on  the  variable  T.  Since  for 
fixed  i V,  Z{Nv,  N)  ~  vN  for  large  v  (ideal  gas  limit)  the  integral  in  (b) 
for  s  >  0  always  converges,  and  one  can  conclude  that  the  abscissa  of 
convergence  of  the  Laplace  integral  is  determined  by  the  convergence 
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of  the  series  over  N.  According  to  the  first  part  of  the  Van  Hove 
theorem  Z(Nv,  N)  s  exp(  -NPMv))  if  N  is  large.  Using  the  method 
of  steepest  descent  the  integral  in  (b)  can  therefore  for  large  N  be 
approximated  by: 

Cm  /  \lia 

(c)  dv  C-N,,v  *e*(v U  ~  (  -  I  e-NI»»0  +  <*(»o>l 

Jo  ~  \Nif(v o)/ 

where  the  saddle  point  v0  is  determined  by 


(d) 


s  +  /? 


&Kvq) 

bv0 


=  0. 


Here  we  also  have  used  the  fact  that  for  v  >  0  b^/dv  <  0  and  </>'  s 
b2ifijbv3  >  0  which  follows  from  the  second  part  of  the  Van  Hove 
theorem  and  from  the  fact  that  the  pressure  p  =  -bi/i/dv  is  always 
positive,  since  it  is  positive  in  the  ideal  gas  limit. 

From  (b)  and  (c)  one  concludes  that  the  abscissa  of  convergence 
of  the  Laplace  integral  is  that  positive  value  of  s  =  \  for  which : 

(e)  2*-lIr0*»(»o>I  =  1 

where  v0  is  determined  from(d)  which  we  now  write  as: 


(f) 


x  = 


O<*)ko) 


Eqs.  (e)  and  (f )  are  two  simultaneous  equations  from  which  x  and  v0 
are  to  be  determined  as  functions  of  z.  We  shall  show  from  the 
Van  Hove  theorem  that  there  is  always  an  unique  solution  for  x,  which 
implies  that  for  large  V : 

Z,r (F,z)  s  evitz) 


so  that  the  Yang-Lee  limit  (a)  exists.  The  properties  of  x(z)  also 
follow  from  (e)  and  (f).  To  see  all  this,  write  Eq.  (e)  in  the  form: 


(g) 


<Hv0)  -  Vo  1  In  2 

bv0  p 


using  (f).  This  is  now  an  equation  for  v0  as  function  of  z,  and  then  (f ) 
determines  x-  Eq-  (g)  can  most  easily  be  discussed  graphically. 
Suppose  first  that  no  condensation  occurs.  Then  according  to  Van 
Hove  ip(v)  is  a  monotonic  decreasing  and  upward  concave  function 
of  v.  For  any  value  of  z,  v0  is  then  uniquely  determined  by  drawing 
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the  tangent  from  the  point  (In  z)/l 3  on  the  0-axis  (see  Fig.  a)  and  the 
slope  of  the  tangent  determines  x  according  to  (f). 

Next  suppose  that  there  is  a  condensation  region,  which  of  course 
the  Van  Hove  theorem  does  not  exclude.  Then  the  curve  will 
have  a  straight  part  with  a  slope  corresponding  to  the  saturated 
vapour  pressure  (see  Fig.  b).  Clearly  in  this  case  the  tangent  con¬ 
struction  gives  for  z  <  zc  a  value  for  v0  >  ra,  for  z  >  zc  a  value  for 
r o  <  i’i,  while  for  z  =  zc  v0  is  not  uniquely  determined.  However, 


Figure  a 

the  slope  of  the  tangent  and  therefore  x  is  for  any  z  uniquely  fixed, 
and  one  easily  sees  that  x  is  a  continuous  and  monotonically  increasing 
function  of  z.  Finally,  since  by  differentiation  after  z,  one  deduces 
from  (f)  and  (g)  that: 


I  r»v 
—  =  z  T75 

I'o 

and  since  r0  is  a  monotonically  decreasing  function  of  z  which  has  a 
discontinuity  in  the  case  condensation  occurs,  it  is  clear  that  x(z)  has 
all  the  Yang-Lee  properties. 

It  should  be  emphasized  that  in  this  equivalence  proof  both  parts 
of  the  Van  Hove  theorem  have  been  used.  The  existence  of  the  limit: 
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(i)  -m>)  -  Limi|nZ(M.,7V) 

N-* D  /V 

is  no/  enough.  For  instance  for  the  van  der  Waals’  expression  for  Z: 

(j)  Z  =  NTtf*  (y  -  Nb)N  cxp[^  7V(7V  -  I)] 


the  limit  (i)  exists,  but 
00  p 


_  kT  _  j* 
dp  r  —  b  ra 


is  not  monotonic.  Constructing  from  (j)  the  grand  canonical  partition 
function,  it  is  then  not  difficult  to  show  that  this  leads  to  the  van  der 
Waals  equation  (k)  plus  the  Maxwell  rule  in  the  condensation  region. 

3.  C.  N.  Yang  and  T.  D.  Lee,  Statistical  Theory  of  Equations  of 
State  and  Phase  Transitions.  I.  Theory  of  Condensation,  Phys.  Rev. 
87  (1952)  404.  In  the  following  paper:  Lattice  Gas  and  Ising  Model, 
Phys.  Rev.  87  (1952)  410  the  same  authors  apply  their  general  theory 
to  a  so-called  lattice  gas,  in  which  one  assumes  that  the  molecules 
of  the  gas  are  restrained  to  lie  on  the  lattice  points  of  a  cubical  lattice. 
Each  lattice  point  is  either  vacant  or  occupied  but  no  two  molecules 
can  be  on  the  same  site.  Assuming  in  addition  that  only  nearest 
neighbours  attract  each  other,  the  authors  show  that  the  zeros  of  the 
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grand  partition  function  ail  lie  on  a  circle,  and  that  in  two  dimensions 
condensation  occurs. 

4.  The  first  example  (Eq.  (20))  is  due  to  G.  W.  Ford.  Lee  and 
Yang  (l.c.  Note  3,  p.  41 5)  introduced  the  density  distribution  g(9) 
of  the  zeros  on  the  unit  circle  which  leads  to  Eq.  (23)  and  also  Eq.  (28) 
for  the  b,  is  due  to  them. 


Chapter  IV 


The  Boltzmann  Equation 


1.  Introduction.  We  now  will  begin  with  the  discussion  of  the  non¬ 
equilibrium  properties  of  gases,  that  is  we  return  to  the  basic  problem 
of  Boltzmann:  why  and  how  is  the  equilibrium  state  reached  in  time. 
Following  the  historical  development,  I  will  start  with  the  theory  of 
dilute  gases  according  to  the  so-called  kinetic  method  of  Boltzmann, 
and  will  discuss  later  how  this  may  fit  into  the  general  Boltzmann- 
Gibbs  picture  of  the  approach  to  equilibrium  which  was  sketched  in 
the  first  chapter. 

The  basic  equation  of  the  kinetic  method  is  the  famous  Boltzmann 
equation:1 

(1)  1=  - +  J*i  JW(*  «/'/'.  -#.) 

which  expresses  how  the  number  of  molecules 

/(r,  v,  f )drdr 

in  the  element  dr  dr  of  the  six  dimensional  phase  space  (p-space)  of  a 
molecule  changes  in  time  due  to  the  streaming  in  ft-space  and  due  to 
the  collisions  with  other  molecules.  In  (1)  a  is  the  acceleration  due 
to  an  outside  potential  U(r)  and  the  streaming  part  of  (1): 

<2)  «/>■ 

is  clearly  just  the  local  change  of  /  per  second  due  to  the  independent 
motion  of  the  molecules  in  fi- space.  One  can  also  say,  that  5(/>  is 
the  one  particle  Liouville  operation  {//,/}  with  H  -  ( p3/2m )  +  U(r). 
The  last  term  in  (I)  describes  the  effect  of  the  collisions;  here  the  prime 
and  the  index  1  of  the /’ s  refer  to  the  velocity  variable  only,  so  that  for 
instance  fx  =  /( r,  v,,  t)  and  the  four  velocity  variables  refer  to  the 
velocities  of  the  binary  collision  (v,  v,)  (v',  v',);  g  =  |v  -  = 

|v',  -  v'|  is  the  relative  velocity  which  in  a  collision  turns  over  the 
angle  0.  To  find  the  number  per  c.c.  and  per  second  of  the  direct 
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collisions  (v,  v,)  -*(v',  y',),  Boltzmann  argues  as  follows  (see  Fig.  1). 
Consider  the  relative  motion  and  surround  all  molecules  with  velocity 
v  with  an  action  sphere  with  a  radius  equal  to  the  range  of  the  forces. 
All  molecules  with  velocity  between  v,  and  vt  +  d\ ,,  which  enter  the 
action  sphere  with  a  collision  parameter  between  p  and  p  +  dp  and 
which  lie  in  the  little  cylinder  gp  dp  de  dt  (e  is  a  polar  angle)  will  in  the 


Figure  I 


next  time  element  dt  collide  with  a  molecule  of  velocity  v  and  in  this 
collision  g  will  turn  over  an  angle  0.  The  relation  between  p,  g  and  6 
is  determined  by  the  dynamics  of  the  collision;  in  fact: 


where  *?0  is  the  smallest  root  of  the  expression  under  the  radical  sign. 
Now,  since  there  are / dv  molecules  with  velocity  v  per  c.c.,  and  since 
each  molecule  with  this  velocity  carries  a  collision  cylinder,  Boltzmann 
puts  for  the  number  of  direct  collisions  per  c.c.  and  per  second: 


(4) 


dA  -  gp  dp  de  ffx  dy  </v, 

=  9)  dSiffi  dn  dyx 
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where  we  have  written : 

p  dp  de  =  l(g,  0)  dSi. 

Kg,  #)  is  called  the  differential  collision  cross-section  for  a  collision 
into  the  solid  angle  dSi  =  sin  6  dO  de.  Clearly  Kg,  0)  through  (3)  is 
uniquely  determined  by  the  force  law  <f>(r). 

Similarly  Boltzmann  puts  for  the  number  of  restituting  collisions 
(»',  vi)  -*•  (v,,  va)  per  c.c.  and  per  second: 

(5)  dB  =  gl(g,  0)  dQf'fi  dv'  dv[. 

Since  the  transformation  from  v,  v,  to  v',  v,'  as  determined  by  the 
conservation  laws  of  energy  and  momentum,  is  clearly  a  linear  ortho¬ 
gonal  transformation,  (hdvx  =  dv'dv(,  and  from  (4)  and  (5)  one 
obtains  therefore  the  collision  term  in  the  Boltzmann  equation  (1) 
since  each  direct  collision  is  a  loss  and  each  restituting  collision  a  gain 
in  the  number  of  molecules  / ch  under  consideration. 

Eqs.  (4)  and  (5)  express  the  so-called  Stosszahl-Ansatz,  which  may 
seem  intuitively  quite  evident,  but  which  is  of  course  an  assumption 
of  a  non-mechanical  nature.  1  will  postpone  the  critical  discussion 
of  this  assumption  to  Chapter  VII.  For  the  time  being  let  us  accept 
that  the  Boltzmann  equation  describes  how  the  distribution  function 
/( r,  v,  t )  of  the  molecules  in  n-space  changes  in  time. 

2.  The  approach  to  equilibrium.  Since  (I)  is  of  the  first  order  in  the 
time,  a  basic  problem  is  the  initial  value  problem:  given  the  distribu¬ 
tion  function  /( r,  v,  0)  at  t  -  0,  find  the  distribution  at  any  later  time. 
It  was  a  great  triumph  for  Boltzmann  that  he  was  able  to  show  that 
any  initial  distribution  approaches  in  the  course  of  time  the  Maxwell- 
Boltzmann  distribution: 

(6)  /o  =  A0  exp[-/3|^  +  l/(r)|] 

for  sufficiently  general  outside  potentials  U( r).  In  here  A0  and  /3  are 
constants  which  are  determined  by  the  given  total  number  and  total 
energy  of  the  molecules. 

Since  Boltzmann's  arguments  are  so  well  known,  I  will  only  sketch 
his  proof. 

The  first  step  is  the  famous  H-theorem.  Let 

H(t)  =  J  jf\ogfdrdv. 


(7) 
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then: 

ffflog/**-®  +ffl 

<//  J  J  \  dt  f  streaming  \  ^  /collisions 


using  (I)  for  df/dt.  Due  to  the  streaming  the  H  function  does  not 
change.  This  follows  from  the  Liouville  theorem.  As  mentioned 
already  in  Chapter  I,  §2,  the  integral  over  the  whole  phase  space  of 
any  function  F(p)  of  the  density  of  the  ensemble  fluid  will  be  indepen¬ 
dent  of  the  time.  Since  with  independent  molecules  the  gas  is  an 
ensemble  of  which  the  density  is  /,  one  can  conclude  that  the  integral 
of  /log /over  the  phase  space  ( =  M-space)  will  not  change  with  time 
because  of  the  streaming.  Due  to  the  collisions  H  always  decreases 
since  by  a  few  simple  and  classical  transformations  one  can  write:3 

(f  L„  -  3  /*/*'  5  »• 

dH/dt  is  zero  only  if  for  all  possible  collisions 

(8)  #.  =  f'fi 

and  this  state  will  be  reached  monotonically  in  time.  Eq.  (8)  deter¬ 
mines  the  dependence  of  the  equilibrium  distribution  on  the  velocities. 
In  fact  the  only  solution  of  the  functional  equation  (8)  is: 

(9)  /=  /fexp{-^(v  -  u)3j 

where  A,  p  and  the  average  velocity  u  can  still  be  functions  of  r  and  t. 
From  the  Boltzmann  equation  follows  that  an  /  of  the  form  (9)  must 
still  fulfill  the  equation: 


(10) 


tf  er  'of 

ir.  +  v  ~L  +  *  "sr  =  0 

dt  Or  dv 


and  one  can  show  that  for  a  sufficiently  general  outside  potential 
l/(r)  (which  include  the  shape  of  the  vessel  or  of  the  wall  potential), 
the  only  solution  of  (10)  which  is  of  the  form  (9)  is  found  by  putting 
in  (9)  u  =  0,  p  equal  to  a  constant  independent  of  r  and  t,  and: 

(11)  A  =  A0e~n',r> 

where  A0  is  a  constant. 

The  proof  is  given  in  Note  3.  Let  me  only  say  that  for  special 
outside  potentials  for  instance  a  harmonic  potential  (/( r)  =  u*r3/2,  the 
spatial  equilibrium  distribution  (II)  will  not  be  reached  in  time.  For 
such  special  potentials  there  are  a  host  of  special  solutions  of  the 
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Boltzmann  equation,  in  which  the  dependence  on  the  velocity  always 
has  the  form  (9)  but  where  the  A,  p  and  a  can  be  functions  of  space 
and  time.  Boltzmann  himself  gave  a  detailed  discussion  of  these 
solutions  (see  Boltzmann,  Wissenschaftiiche  Abhandlungen,  Vol.  II, 
p.  83).  They  have  however  only  a  limited  interest. 

Let  me  conclude  with  a  number  of  remarks. 

(a)  Clearly  Boltzmann  does  not  actually  solve  the  initial  value 
problem  stated  in  the  beginning  of  this  section.  In  fact  he  supposes 
in  the  first  place  that  the  initial  value  problem  has  an  unique  solution 
and  then  that  if  at  t  =  0  the  distribution  function  /  goes  to  zero  for 
r,  v  -»  qo  sufficiently  fast  for  the  convergence  of  all  the  required 
integrals  (especially  for  the  total  number,  for  the  total  energy  and  for 
H),  then  also  for  any  later  time  these  integrals  remain  convergent. 
From  the  mathematical  point  of  view  the  proof  of  Boltzmann  is  there¬ 
fore  not  irreproachable.  For  the  spatially  uniform  case  (U  s  0; 
/  only  function  of  v  and  /)  and  for  elastic  spheres  (/(#,  6)  =  constant) 
Carleman  has  given  a  rigorous  proof.4  I  will  not  try  to  indicate  the 
method  Carleman  uses,  since  it  does  not  add  to  the  physical  insight. 
I  do  think  though  that  an  extension  of  Carleman's  work  especially  to 
the  spatially  non-uniform  case  would  be  quite  valuable. 

(b)  The  proof  of  Boltzmann  does  not  mean,  that  the  approach  to 
equilibrium  happens  in  two  sharply  separated  successive  stages:  first 
the  approach  in  velocity  space  to  the  Maxwell  distribution  (9)  and  then 
the  approach  in  coordinate  space  to  the  **  barometric”  distribution  (1 1). 
Both  approaches  are  coupled  to  each  other  and  the  velocity  distribu¬ 
tion  will  be  strictly  Maxwellian  only  if  the  complete  equilibrium  is 
reached. 

(c)  However  it  is  true  that  the  approach  to  equilibrium  in  velocity 
space  is  quite  different  from  the  approach  to  equilibrium  in  coordinate 
space.  It  is  physically  plausible  and  it  is  also  suggested  by  Boltz¬ 
mann's  proof,  that  the  velocity  distribution  in  any  volume  element  of 
space  will  approach  a  local  Maxwell  distribution  of  the  form  (9)  very 
quickly  and  in  a  monotonic  fashion.  It  will  never  reach  it  because 
the  streaming  and  the  outside  forces  will  interfere,  but  it  will  have  this 
tendency.  On  the  other  hand  the  approach  to  equilibrium  in  co¬ 
ordinate  space  is  not  monotonic  in  general  and  will  require  a  much 
longer  time.  Of  course  this  picture  of  the  approach  to  the  complete 
Maxwell-Boltzmann  distribution  (6)  must  and  can  be  confirmed  by  a 
deeper  study  of  the  Boltzmann  equation.  It  is  the  basis  of  a  successive 
approximation  method  (the  Chapman-Enskog  development)  which 
we  will  discuss  in  Chapter  VI. 
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3.  The  linearized  Boltzmann  equation.  Accepting  the  fact  that  the 
Maxwell-Boltzmann  distribution  (6)  will  be  reached  in  time  it  is  clearly 
of  interest  to  investigate  the  case  of  a  small  disturbance  of  the  equili¬ 
brium  state.  Put  in  (I): 

(12)  /  =  /oil  +  Mr,  v,  /)] 

and  neglect  the  quadratic  terms  in  A.  One  then  gets  the  linearized 
Boltzmann  equation: 

Jt  =  "V  !r  ~  *  S  +  ldngHg,0)Mh'  +  A',  -  A  -  A,) 

(13) 

=  S(h)  +  C(h). 

Consider  first  the  spatial  uniform  case,  in  which  there  are  no  external 
forces,  so  that  the  disturbance  A  depends  only  on  v  and  /,  and 


(14) 


.  /  m  \3  2  /  mva\ 


where  the  number  density  n  and  the  temperature  T  are  constants. 
To  solve  the  initial  value  problem,  it  is  clearly  of  interest  to  find  the 
eigenfunctions  ti  and  eigenvalues  A,  of  the  collision  operator  C, 
defined  by 

(15)  C(ti)  =  A,0(. 

Some  properties  are  quite  obvious. 

(a)  There  are  five  zero  eigenvalues  corresponding  to  the  eigen¬ 
functions  I,  v,  r2;  this  is  a  consequence  of  the  five  conservation  laws 
during  a  collision. 

(b)  All  other  eigenvalues  must  be  negative. 

Proof. 


A,  = 


J^v/o0iC(</',) 

J  'ft/o<A? 


and: 


jdvf0tiC(ti)  =  -  1  fdv  Jdr,  j  dCighg,  e)f0f0M\  +  til  -ti-  til )2 

using  the  same  transformations  as  used  in  the  proof  of  the  //-theorem. 

(c)  Different  eigenfunctions  are  orthogonal  to  each  other.  This 
follows  from: 

J  JvfJKCiti)  =  J  dvfotiC(ti). 
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(d)  The  operator  C  is  an  isotropic  operator  in  velocity  space  and 
therefore  the  eigenfunctions  will  have  the  form: 

06)  <i>rlm  =  Rrl(v)Ytm(6,<t>) 

using  polar  coordinates  in  velocity  space,  and  the  eigenvalues  Arl  will 
at  least  be  (2/  +  l)-fold  degenerate. 

Assuming  that  the  <li,  form  a  complete  set,  one  can  clearly  solve  the 
initial  value  problem  in  the  spatially  uniform  case  by  developing 
/r(v,  t)  in  the  and  one  gets: 

(17)  h(y,  t)  =  V  r,eV0,(v) 

t 

where  the  ct  are  determined  from  h(\,0).  Since  A,  <  0,  (17)  shows 
that  h  goes  to  zero  except  for  the  permanent  changes  in  density, 
average  velocity  and  temperature.  The  quantities  1/A,  clearly  repre¬ 
sent  a  spectrum  of  relaxation  times. 

The  question  remains,  whether  such  a  discrete  spectrum  of  relaxation 
times  really  exists.  So  far  as  I  know  this  has  not  been  proved  in 
general.  One  does  know5  that  the  collision  operator  C(h)  is  of  the 
form: 

(18)  C(h)  =  -a(v)h  +  J </*,/„,*(»,  v,)/Kv,) 

but  the  kernel  tf(v,  y,)  is  singular,  so  that  the  usual  theory  is  not 
applicable.  It  is  still  simple  to  see  that  whatever  the  spectrum  is,  it 
is  bounded  from  below,  if: 

jdyf0a(v)  =  Jrfv  jd\J0f0t  j dQgl{g,0) 

exists,  which  implies  that  the  total  cross  section: 

(19)  Q(g)  =  jdSiHg,0) 

must  be  finite,  which  again  follows  from  the  physically  natural  assump¬ 
tion  that  the  forces  have  a  finite  range.  I  believe  that,  in  this  case, 
the  spectrum  will  be  discrete  and  that  the  A,  will  approach  a  finite 
negative  limit  point.  But  even  for  hard  spheres  (/(#,  0)  =  const.) 
this  has  not  been  proved.8 

Because  of  these  uncertainties,  it  is  reassuring  that  for  one  special 
case  the  spectrum  and  the  eigenfunctions  can  be  determined  explicitly. 
Since  for  this  case  we  will  discuss  in  the  next  chapter  a  special  applica¬ 
tion  of  the  linearized  Boltzmann  equation,  I  will  collect  here  the 
precise  formula. 
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Going  back  to  (13),  introduce  dimensionless  velocity  variables  by 
putting: 


and  similarly  for  »j.  Put  also: 

' - (ST '• 

then  with  a  =  0  (no  outside  fields),  so  that  f0  is  still  given  by  (14), 
the  linearized  Boltzmann  equation  becomes: 

(20)  ~  +  c  =  naj(h) 

07  Of 

with: 


(21 )  J(h)  =  ^  J\fc,  exp  ( -  c?)  J  d!i  Fig,  6)[h'  +  h\-h-  AJ, 


o  is  a  constant  of  the  order  of  the  total  cross  section,  which  is  intro¬ 
duced  to  make  Fig,  8)  dimensionless.  I  will  call  Jih)  the  dimension¬ 
less  collision  operator.  It  still  has  of  course  the  same  properties 
as  Cih). 

Now  assume  that  Fig,  8)  is  a  function  of  0  alone.  Since  the  relation 
between  the  velocities  c',  c',  after  collision  and  the  velocities  c,  Cj 
before  collision  is  a  linear  relation,  it  is  clear  that  in  this  case  if  A  is  a 
polynomial  of  degree  n  in  v  also  Jih)  will  be  a  polynomial  of  degree 
n  in  v.  The  eigenfunctions  of  J  will  therefore  be  polynomia,  and  in 
fact  one  can  verify  that  the  eigenfunctions,  written  again  in  polar 
coordinates  in  velocity  space,  are: 

(23)  =  NtlmSy>mic2)clYlmi0,  t) 

where  the  SyUa  are  the  associated  Laguerre  polynomia  (also  called 
Sonine  polynomia).  SV’M  is  defined  as  the  coefficient  of  tn  in 

(I  -  f)“n~'  exp|- 
and  their  explicit  expression  is: 


ST\x)  =  £  (-*)’ 

p«0 


jn  +  m)! 

p\im  -  p)\{n  +  p)\ 
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In  (23)  the  Nrtm  are  normalization  constants.  The  0rlm(c)  form  a 
complete  orthogonal  set  of  eigenfunctions  with  weight  function  e~e *, 
and  the  corresponding  eigenvalues  are: 


(24) 


\rt  =  2n  J*  dd  sin  0/'(d)[cos2,+l 


+  sina,+,|/>,^sin^  -  1  -  $ro$io]- 


Eqs.  (23)  and  (24)  were  found  first  by  C.  S.  Wang  Chang  and  then  also 
independently  by  L.  Waldmann.7  Note  that  the  five  zero  eigenvalues 
are  A00,  A01  and  A10  corresponding  to  the  eigenfunctions  1,  c  and 
(3/2  -  c2).  One  further  sees  from  (24)  that  the  Arl  converge  for  large 
r  and  /  to: 

(25)  -  2n  J*  d0  sin  0F{0) 


which  we  can  assume  to  be  equal  to  -  I  by  the  proper  choice  of  a. 

I  will  call  this  special  case  of  the  linearized  Boltzmann  equation  the 
quasi-Maxwell  model.  It  is  not  a  realistic  model,  in  the  sense  that 
there  is  no  intermolecular  potential  for  which  F(g,  6)  depends  only  on 
0  and  for  which  the  integral  (25)  converges.  When  the  force  is  equal 
to  */•" 5  ( Maxwell  model), 

/2k\'13 

gHg,0)  =  (^)  Fm(6) 


but  for  small  0,  FM(0)  -  0  5  2  so  that  (25)  diverges,  corresponding 
to  the  fact  that  Hr)  has  an  infinite  range. 8  However,  it  seems  to  me 
that  the  quasi-Maxwell  model  must  be  quite  similar  to  the  actual  case 
of  forces  with  finite  range,  and  will  give  therefore  at  least  a  qualitative 
insight. 

Finally  let  me  point  out,  that  in  (19)  the  order  of  magnitude  of  the 
eigenvalues  of  the  collision  operator  noJ(h)  is  clearly  no,  which 
corresponds  to  the  order  of  magnitude  no(kT/m)l'a  for  the  eigenvalues 
A,  of  C(h).  Thus  (A,)  - 1  is  of  the  order  of  the  time  between  collisions 
which  confirms  the  view  that  the  Maxwell  distribution  will  be  reached 
after  a  few  collisions,  that  is  in  a  time  of  the  order  of  the  time  between 
collisions. 
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1.  For  the  complete  discussion  Boltzmann's  book  Vorlesurtgen  iiber 
Gastheorie  (Ambrosius  Barth,  Leipzig,  1912)  is  still  indispensable. 
See  esp.  Vol.  1,  Part  II,  and  Vol.  2,  Part  VII. 

2.  One  has: 


("L -/*/*./**«*.  Ota/'W-#.) 

=  Sdy  J*>  I dQ  *Hg' 0)  ln/i  (ff{  ~ #i)> 

by  interchanging  v  and  v,.  Therefore,  adding  the  two  expressions  and 
dividing  by  two: 

(f  L,  -  5  h  h  / dsig,,g ■ >m!  -  ff'] " m 

=  \  Ith  /*>  fdQ  gH *' em  ~  rf[)  ,n  (/,/,)’ 

by  interchanging  the  velocities  before  and  after  collision  and  using 
again  </v</v,  =  dV  d\\.  Adding  the  two  expressions  and  dividing  by 
two  leads  to  the  form  given  in  the  text. 


3.  Write  Eq.  (9)  in  the  form: 


In/  =  - hi 3  +  kv  +  n 


so  that  h  =  Pm/2,  k  =  mjSu  and  n  =  In  A  -  pmu2/2.  Substituting  in 
(10)  and  requiring  that  (10)  must  be  fulfilled  identically  in  v  leads  to 
the  equations: 


(a) 


(b) 

(c) 

(d) 


dk,  8k,  -bh 
to,  +  to,  -  2  57  8"  “  "• 

f  +  *!  -  2ha,  -  0. 

8t  8x{ 

8n  .  . 

07  +  ^a<*a  ~  0. 
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From  (a)  and  (b)  one  concludes  that  h  can  only  be  a  function  of  the 
time  and  that 

(e)  k  =  k0(/)  +  (ft(/)  x  r]  +  —  r. 


At  this  stage  one  can  say  therefore  that  the  average  motion  of  the  gas 
can  only  be  a  possibly  time  dependent  combination  of  a  translation, 
rotation  and  a  radial  expansion  or  contraction.  Since  ma  = 
-grad  U(r),  it  follows  from  (c)  that: 

(f)  |  -  -grad(n  +  1 1/)- 

Therefore  the  curl  of  e>k/di  must  be  zero,  which  implies  that  ft  must 
be  a  constant  and  then  it  follows  from  (e)  and  (f)  that: 


n  +  ?£  U  =  n0(t )  - 

m 


1  dVt 

2  dt3 


Substituting  this  expression  for  n  in  the  last  equation,  (d),  which  has 
to  be  fulfilled,  leads  to: 


(8) 


I  ,  ,  ,,  ,  2  dh  ,, 

mk^U  +  mdiU~ 


dn0 

dt 


d3k0  _  1  <Ph  a 
dt 3  r  2  dt3  r  ■ 


Now  suppose  that  from  an  arbitrary  origin  the  outside  potential  U(r) 
can  be  developed  in  a  power  series: 

(h)  ’  U  =  Aaxa  +  ^  Aa„xax„  +  ^  AalltxaXgXr 


where  the  /4's  are  known  constants.  Higher  order  terms  will  be 
neglected  since  it  will  turn  out  that  with  arbitrary  >4's  the  form  (h)  has 
already  sufficient  generality.  Introducing  (h)  in  (g),  it  is  clear  that 
only  the  left  hand  side  will  contain  terms  of  the  third  degree  in  the  xt. 
Equating  the  coefficients  of  these  terms  to  zero  gives  ten  homogeneous 
linear  equations  in  the  four  unknowns  dh/dt  and  ft.  Thus  one  finds 
that  in  general  h  must  be  constant  and  ft  =  0.  From  the  terms  of 
the  second  degree  in  the  one  then  can  conclude  that  also  k0  =  0, 
so  that  k  is  zero  all  together.  From  (d)  follows,  that  then  n  must  be 
independent  of  the  time,  and  from  (c)  one  concludes  that  n  +  2Jtl//m  = 
n0  which  is  equivalent  to  Eq.  (II)  in  the  text. 

4.  T.  Carleman,  Problemes  Mathematiques  dans  la  Theorie  Cinetique 
des  Gaz  (Publications  Scientifiques  de  I'lnstitut  Mittag-Leffler,  Vol.  2, 
Uppsala,  1957).  The  generalization  of  the  proof  for  other  force  laws 
with  a  finite  range  is  straightforward. 
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5.  This  was  first  shown  by  D.  Enskog,  Kinetische  Theorie  der 
Vorgtinge  in  Massig  Verdtinnten  Gasen,  Dissertation,  Uppsala,  1917, 
p.  140.  For  a  more  transparent  proof  see  L.  Waldmann,  Handbuch 
der  Physik  12,  p.  366. 

6.  There  is  also  the  following  troublesome  question.  Should  one 
impose  the  condition  that  the  integral : 

(i)  jdyfoP 

exists  in  order  to  select  the  eigenvalues  and  eigenfunctions  of  the  linear 
collision  operator  C(0)?  On  the  one  hand  this  is  necessary  if  one 
wants  to  apply  the  usual  Hilbert  space  theory.  But  on  the  other  hand 
the  integral  (i)  has  no  physical  meaning  in  contrast  to  the  case  of 
the  quantum  mechanical  operators.  This  difficulty  was  noticed  by 
N.  G.  van  Kampen  (Physica  21  (1955)  949;  see  especially  the  remarks 
on  p.  957).  L.  Waldmann  (Handbuch  der  Physik  12,  p.  365  and  p.  367) 
has  tried  to  connect  the  square  integrability  condition  with  the  require¬ 
ment  that  the  entropy  should  always  exist.  However  his  argument  is 
not  convincing. 

7.  C.  S.  Wang  Chang  and  G.  E.  Uhlenbeck:  On  the  Propagation  of 
Sound  in  Monatomic  Gases.  Report  of  the  Engineering  Research 
Institute,  University  of  Michigan,  1952.  L.  Waldmann,  Handbuch  der 
Physik  12,  §38. 

8.  Explicitly  one  finds  (see  Maxwell,  Collected  Papers,  Vol.  II,  p.  42): 

r  m  =  1 _ (COS  2<f>)'  2 _ 

M  4  sin  0  sin  2 <f>  {cosa  <f>K( sin  <t>)  -  cos  2<££(sin  <f>)} 

where  <f>  is  related  to  0  by : 

—y  =  (cos  2<f>)'  2£(sin  <f>). 


K(x)  and  E(x)  are  the  complete  elliptic  integrals  of  the  first  and 
second  kind  respectively.  FM(0)  is  a  monotonic  decreasing  function 
of  0;  for  small  0: 


«^1J0' 


5i2/, 


+  Js#  + 


FmW  = 


1 

4Ka(w/4) 


=  0.0727- 


while  for  0  =  »r: 


Chapter  V 


The  Propagation  of  Sound 


1.  Introduction.  As  an  application  of  the  linearized  Boltzmann 
equation  let  me  consider  the  propagation  of  sound  in  a  gas.  Clearly, 
if  the  gas  is  in  equilibrium  and  if  the  intensity  of  the  sound  is  not  too 
high,  then  the  sound  waves  will  be  a  small  disturbance  from  equili¬ 
brium  and  should  therefore  be  described  by  the  linearized  Boltzmann 
equation. 

First  let  me  tell  you  some  of  the  experimental  facts.  .  The  first 
question  is  how  fast  sound  propagates,  and  the  answer  seems  without 
mystery  and  is  known  to  every  physics  student.  He  knows  that  the 
velocity  V0  depends  on  the  adiabatic  compressibility  of  the  gas  and 
that  one  gets: 


/££*ry/a  = 

/5  kTV 

\r„  m  ) 

\3  mj 

for  a  mono-atomic  gas,  to  which  I  will  always  restrict  myself.  How¬ 
ever,  if  one  does  not  look  upon  the  gas  as  a  macroscopic  material  with 
some  compressibility,  but  as  a  collection  of  almost  independent 
molecules,  then  it  seems  to  me  that  the  sound  propagation  becomes 
more  mysterious.  How  is  it  possible  to  impose  on  the  random  motion 
of  the  molecules  the  ordered  motion,  or  in  present  day  terminology 
the  collective  mode  of  motion,  which  a  sound  wave  represents?  That 
one  runs  into  rather  deep  questions  becomes  apparent,  when  one 
wants  to  find  the  frequency  dependence  of  V,  the  dispersion  of  sound, 
and  the  accompanying  absorption  of  sound.  Since  the  days  of 
Kirchhoff  one  knows  that  for  increasing  frequency,  V  becomes  larger, 
and  absorption  sets  in.  The  latter  can  still  be  partially  understood 
from  the  macroscopic  point  of  view.  At  high  frequency  the  tempera¬ 
ture  and  velocity  gradients  between  compression  and  rarefaction 
become  so  steep  that  although  the  time  between  them  gets  shorter,  the 
heat  conduction  and  the  viscosity  of  the  gas  begin  to  have  effect  and 
cause  absorption.  Kirchhoff  found  for  the  absorption  coefficient  in 
amplitude : 
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(2) 


“0 


_  a£_  (2 

Pyi  u 


p  + 


—  —\ 
15  k  ; 


where  to  -  angular  frequency,  p  the  density,  P  the  viscosity  coefficient 
and  k  the  heat  conduction  coefficient  of  the  gas.  Note  that  since 
k  ~  (A//h)p,  p  ~  pA^r)  with  A  the  mean  free  path,  <r>  some  average 
thermal  velocity  (hence  '  <■>  ~  V0 ),  (2)  shows  that  a0  ~  X/A 3  if  A  is 
the  wave  length  of  the  sound.  Experimentally  (2)  has  been  well 
verified,  when  the  frequency  to  is  not  too  high. 

What  happens  if  the  frequency  gets  higher  and  higher?  The  most 
extensive  experiments  have  been  done  by  M.  Greenspan  for  helium 
gas.1  His  method  was  the  usual  one,  although  many  precautions  and 
refinements  were  necessary  of  course.  A  piezoelectrically  driven 
quartz  plate  A  produced  the  sound  waves  and  the  density  fluctuations 
were  picked  up  at  an  identical  receiver  quartz  plate  B  producing 
voltage  fluctuations  which  then  could  be  amplified.  The  difference  in 
phase  of  the  fluctuations  between  A  and  B  determines  the  phase 
velocity  and  the  ratio  of  the  amplitudes  gives  the  absorption  coefficient. 
The  distance  L  between  the  plates  was  always  much  larger  than  the 
mean  free  path  A.  The  results  seem  to  indicate  that  the  velocity  V 
and  the  absorption  coefficient  « increase  monotonically  with  frequency. 
The  highest  value  of  A//1  which  could  be  realized  was  of  order  one 
and  the  velocity  V  had  then  increased  to  about  3  V0,  while  the  absorp¬ 
tion  coefficient  had  become  of  order  I /A,  so  that  the  absorption  was 
tremendous. 

What  happens  for  tu  -  or.  ?  Some  people  have  speculated  that  V 
and  ft  might  become  infinite,  since  practically  infinite  velocities  occur 
according  to  the  Maxwell  distribution.  Others  have  thought  that  V 
might  become  constant  again  while  the  »  goes  through  a  maximum, 
similar  to  the  observed  behaviour  in  poly-atomic  gases.  But  there  is 
no  real  reason  for  either  belief. 


2.  The  Wang-Chang  method.  Returning  now  to  the  theory,  to  find 
the  dispersion  law  of  the  sound  waves  it  seems  natural  to  seek  a  solu¬ 
tion  of  the  linearized  Boltzmann  equation  (see  Chapter  IV,  §3): 


(3) 


ih  i'h  . 

_  +  c  -  -  mm. 


J(h) 


^4  Jr/c.e-'fJ dQ  F(g,  8)[h’  +  h{  -  h  -  AJ 


which  has  the  form  of  a  plane  wave,  say  in  the  r-direction.  Put 
therefore: 


(4) 


h  =  exp[/(Arz  -  <o0t)]/j0(c) 
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where  a»0  ■»  w(ml2kT)lia  and  k  =  2n/A  is  the  wavenumber  (not  to  be 
confused  with  the  Boltzmann  constant  k  which  occurs  always  in  the 
combination  kT).  Now  develop  M c)  in  the  eigenfunctions  of  J(h ): 


Me)  =  2  Mi(c). 

I 

One  clearly  gets  an  infinite  set  of  linear  equations 

(5)  ik  ^  ~  i<^oa,  =  na\tat 

t 

with: 

(6)  M,m  = 

The  equations  (S)  will  have  a  non-trivial  solution  only  if  the  infinite 
determinant  of  the  coefficients  vanishes.  This  condition : 

(7)  \ikMln  -  (iw, o  +  naX,)6,J\  =  0 

gives  a  relation  between  u>0  and  k  and  hence  expresses  the  dispersion 
law  of  the  gas.  However,  the  discussion  is  hard.  All  that  we  could 
think  of  was  to  break  off  the  determinant  successively.  It  is  simple 
to  see  that  taking  only  the  first  three  rows  and  columns,  corresponding 
to  the  three  zero  eigenvalues  (for  which  the  eigenfunctions  are  1, 
c.  and  c2  -  3/2,  so  that  the  corresponding  elements  of  M,m  are  simple 
to  evaluate),  leads  to  the  cubic  equation: 

<*o(^  -  l  *2)  =  o 

with  the  three  roots  o»0  =  0,  <o0  -  ±k( 5/6) 12.  The  last  two  roots 
clearly  represent  “propagating"  modes  going  to  the  right  and  left 
with  the  velocity: 


y0 


OJ 

“0  i 

/2A7V'2 

/sm 

k  = 

T1 

h  rn) 

so  that  we  have  derived  Eq.  (I).  By  taking  more  rows  and  columns 
one  gets  higher  order  algebraic  equations  in  <u0,  and  by  then  following 
the  propagating  modes  (that  is  the  roots  which  for  k  ->  0,  become 
±fc(5/6),/a)  one  can  see  what  happens  for  larger  k  or  smaller  wave¬ 
lengths.  This  is  straightforward  for  the  quasi-Maxwell  model,  and 
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one  obtains  series  expansions  for  the  velocity  V  and  the  absorption 
coefficient  «  of  the  form : 


(8) 


V  = 


a  = 


K*>[' +  +““fe) +  ]• 


where  the  numerical  coefficients  a,  and  bt  depend  successively  on  the 
eigenvalues  A,  ordered  according  to  their  absolute  magnitude. 

Unfortunately  we  could  not  discuss  the  convergence  of  this  pro¬ 
cedure  and  so  one  can  not  get  an  answer  to  the  question  what  happens 
at  high  frequencies  for  which  ut/no  Vn  is  of  order  one.  In  fact  the 
coefficients  a{  and  />,  increase  rather  rapidly,  so  that  numerically  the 
convergence  is  bad.2 

3.  The  Kac  successive  approximation  idea.  What  is  needed  is  a  more 
precise  discussion  of  the  experimental  situation  as  sketched  in  the 
beginning  of  this  chapter,  and  which  should  clearly  be  formulated  as  a 


Figure  I 

boundary  value  problem.  However,  it  is  simpler  to  think  of  the 
initial  value  problem— given  at  /  =  0  the  disturbance  Mr,  v,  0),  find  h 
at  any  later  time — which  also  should  provide  an  insight  into  the 
behavior  of  the  collective  modes  of  motion  of  the  gas.  For  the  actual 
equation  (3)  we  have  not  yet  succeeded  to  find  the  solution,  but  it  is 
possible  to  solve  the  problem  for  a  simplified  version  of  (3)  which  is 
due  to  Mark  Kac.3 

One  can  always  write  (suppressing  r  and  /): 

J{h)  =  fdc,  L(c,  c,)Mc,) 

(9) 

with:  Me,  c,)  =2,Al^(cW-((c,)exp(-c?). 

1 

Think  especially  of  the  quasi-Maxwell  model  for  which  the  spec¬ 
trum  is  shown  again  in  Fig.  I.  Put  now  in  (9)  as  a  kind  of  zeroth 
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approximation  all  A,  =  -1,  except  the  S  zero  eigenvalues.  In  first 
approximation  one  would  then  keep  the  first  negative  eigenvalue  and 
collapse  all  others  in  -  I ,  and  so  on.  At  every  stage  one  can  solve  the 
initial  value  problem  and  one  can  also  discuss  in  detail  all  the  normal 
modes,  because  the  linearized  Boltzmann  equation  now  becomes  an 
integral  equation  with  a  degenerate  kernel.  In  fact  using  the  com¬ 
pleteness  relation  of  the  one  gets  in  zeroth  approximation  from  (3) 
and  (9): 

(10)  ^  +  nah  +  c  ~ 
cr  df 

=  pra  J*  1  exp(-c?)[l  +  2c  c,  +  ^c2  - 

and  this  should  also  be  the  zeroth  approximation  for  any  molecular 
model  for  which  the  range  of  the  forces  is  finite. 

To  simplify  the  algebra  as  much  as  possible  I  will  discuss  instead  of 

(10) ,  the  equation: 

(11)  +  Yh  +  f  =  (2^p  j  „  dc'  exp(-r?/2Xl  +  rc, )/*<•,) 

which  is  a  kind  of  one-dimensional  version  of  (10).  It  is  related  to  a 
one-dimensional  Boltzmann-like  equation,  which  was  also  proposed 
by  M.  Kac  and  it  can  be  obtained  by  a  mutilation  of  the  linearized 
form  of  this  equation.4  The  mathematics  of  (10)  and  (1 1)  are  quite 
similar  and  also  the  results  are  similar,  but  everything  is  much  simpler 
for  (II). 

Since  in  (II)  the  coefficients  are  independent  of  x,  one  can  always 
make  a  Fourier  transformation  in  x.  Put: 

hk(c,  r)  =  J  dx  e  ikIh(x,  c,  r); 

then  (II)  becomes : 

(12)  ^  +  (y  +  ikr)hk  =  J  dci  exp(-r?/2)(l  +  cc,)/ik(ci,  r). 

To  solve  the  initial  value  problem  one  can  use  two  methods: 
(I)  the  Laplace  transform  method  in  r,  which  brings  in  immediately 
the  initial  value  of  hk ,  or  (2)  the  method  of  normal  modes ,  in  which  the 
initial  distribution  must  be  developed  (provided  the  normal  modes 
form  a  complete  set)  and  from  which  the  further  temporal  develop¬ 
ment  then  follows  immediately. 
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Of  course,  if  you  do  it  right  the  two  methods  will  always  give  the 
same  answer.  1  will  give  some  details  of  both  methods,  mainly 
because  for  the  quite  analogous  problem  of  plasma  oscillations  there 
has  been  some  confusion  in  the  literature  which  only  recently  has  been 
completely  dispelled  by  K.  M.  Case  (Ann.  Phys.  7  (1959)  349).  Before 
embarking  on  this,  note  that  from  (11)  still  follows  by  multiplying 
with  e~'2'a  and  integrating  over  c : 


(13) 

with 


Or  Ox 


0 


(14) 


p(x ,  r)  =  f  dceci'ah(x,  c,  r), 

J  -  00 

<Kx.  r)  =  f  rfff-'1  ach(x,  C,  r) 


or  after  a  Fourier  transformation  in  x  in  obvious  notation: 


(15) 


dPk(r) 

dr 


+  Hcf>k(T)  -  0. 


Eq.  (13)  is  a  kind  of  continuity  equation  or  macroscopic  equation  for 
this  model. 


4.  The  Laplace  transform  method.  Put  now : 


hkp(r) 


dr  e~  r'hk(c,  r); 


then  one  gets  from  (12): 

(16)  (p  +  y  +  ihc)hk„  =  hk(c,  0)  +  ^Jyia  (fap  +  c4>kP)- 
But  from  (15)  follows: 


•t>kp  = 


]_ 

ik 


(MO)  -  ppkv). 


Put  this  in  (16),  then: 

,  =  hk(c,  0)  yp»(0) _ c  YPkp  1  -  cpjik 

kr  P  +  Y  +  ike  ik(2n)lia  p  +  y  +  ike  (2n)l  ia  p  +  y  +  ike 
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| 

i 


Multiply  with  exp(  -  ca/2)  and  integrate  over  c.  One  then  gets  a  trap 
for  p, ,p  and  one  finds: 


(18) 

with: 


Pkp  ~ 


1 


G(p,  *)J- 


kk(e,0)  +  (.^2w)i/a  PdQ) 


dc 


P  +  y  + 


ike 


c*/a 


G(p,  k)  =  1  - 


—1 —  f 

(2jt)1,s  .I-® 


dc 


1  -  cpjik 
p  +  y  +  ike 


e 


-cJ/a 


This  put  in  (17)  determines  hkp  completely  in  terms  of  the  initial  data 
hk(v,  0)  and  pk(0).  In  fact  once  p(x,  r)  is  known  by  inversion  of  (18), 
it  is  quite  easy  to  invert  (16)  and  one  finds: 


09) 


h(x,  c,  r)  =  <?-  ylh(x  -  CT,  V,  0)  +  J  dr'  e- 


x  [p(x  —  cr',  r  —  t')  +  c^(x  —  cr'. 


r')]. 


This  shows  that  the  influence  of  the  initial  distribution  damps  out 
in  time  and  that  for  large  r  the  distribution  function  h(x,  c,  r)  depends 
only  on  the  “macroscopic"  quantities  p  and  <f>. 


5.  The  normal  mode  method.  This  method  is  perhaps  more  physical, 
and  anyway  it  is  more  familiar.  Go  back  to  ( 1 2)  and  seek  solutions  of 
the  form: 

hk(c,  r)  =  hulk(c)e~,,:‘'. 


The  function  hok(c)  is  the  eigenfunction  or  normal  mode  corresponding 
to  the  eigenfrequcncy  u>.  One  gets  from  (12): 


(20)  (-/«>  +  y  +  ikc)huk  =  ^J)'  3  (Pu,k  + 
and  from  (15): 

(21)  =  k<f>uk. 

Because  the  equation  (20)  is  homogeneous  in  huk,  one  can  always 
normalize  huk  so  that: 


(22)  Pulk  =  I"0  dee  '2"hwk(e)  =  I 

J  -  no 

and  then  from  (20)  and  (21)  one  gets: 


(23) 


(-iw  +  y  +  ike)hlok 
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Now  one  has  to  watch  out!  From  (23)  one  can  only  conclude: 


(24) 


h„k(c)  =  + 


y  I  +  wc/k 
(2n)lia  ike  —  iio  +  y 


where  A  is  arbitrary.  To  find  the  spectrum  of  eigenfrequencies  <u, 
one  has  to  distinguish  two  cases : 

(a)  e  —  (u>  +  iy)jk  is  real.  Then  from  the  normalization  condition 
(22)  one  gets: 


I  =  Ae'3'3  + 


ik(2n) 


dc 


1  +  utc/k  ,, 


c  -  c 


i -  e~ 


where  the  principal  value  of  the  integral  is  meant.  This  determines  A 
and  hence  for  fixed  real  k,  there  is  a  continuum  of  complex  eigen¬ 
frequencies  all  with  imaginary  part  equal  to  -/y.  For  these  modes 
one  can  not  speak  of  a  dispersion  law  since  there  is  no  relation  between 
w  and  k. 

(b)  c  is  not  real.  Then  since  c  is  real,  one  can  omit  in  (24)  the 
6-function.  From  the  normalization  condition  one  then  finds: 


(25) 


I  = 


ik(2ir) 


1  13 


dc 


I  4-  tac/k  e_c2i2 
c  -  c 


The  right-hand  side  is  an  analytic  function  of  <«  (always  for  fixed  real  k) 
everywhere,  except  on  the  line  where  the  imaginary  part  of  w  is  -iy. 
Hence  (25)  can  only  be  fulfilled  for  a  discrete  set  of  values  of  w  (which 
may  of  course  be  the  null  set)  for  each  value  of  k.  For  these  discrete 
modes  there  is  therefore  a  dispersion  law,  and  they  are  the  collective 
modes  among  which  there  must  be  the  sound  propagation. 

The  connection  between  the  two  methods  is  clear.  The  modes  are 
the  poles  in  the  /?- plane  in  the  expression  (18)  for  the  Fourier-Laplace 
transform  of  the  density  p.  Especially  Eq.  (25)  is  just  the  equation 
G(  ~iw,  k)  =  0.  I  will  therefore  not  further  elaborate  on  the  deriva¬ 
tion  of  the  solution  of  the  initial  value  problem  by  means  of  the 
normal  modes. 

Because  in  this  simple  case  the  Eq.  (25)  for  the  discrete  modes  is 
quite  explicit  and  simple,  it  is  possible  to  find  the  discrete  modes  and 
to  follow  them  as  functions  of  A:.  Put: 


;  =  wjk,  x  =  y/k,  £ 

F({) = (w*  !-»dce 


=  z  +  ix, 

-  ca  13 


C-  £’ 
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then  the  transcendental  equation  (25)  can  be  written  in  the  form: 

(26)  H(x,  {)  ■  *a{£F(0  +  1}  +  te{«*  +  l)F(0  +  £}+l=0 
and  one  can  prove  (I) 

(27)  H*(x,  0  =  H(- x,  {*)  =  H(x,  - {*) 

where  the  star  denotes  the  complex  conjugate. 

(2)  For  x  >  0  (that  is  k  >  0)  H(x,  C)  has  no  roots  in  the  lower 
half  plane,  and  zero  or  two  roots  (depending  on  the  value  of  x)  in  the 


lm  £ 


Im  (J 


upper  half  plane.  Because  of  (27)  the  two  roots  (if  present)  will  have 
the  same  imaginary  part  and  opposite  sign  for  the  real  part.  They 
correspond  to  a  damped  sound  wave  going  to  the  right  and  to  the  left. 
For  x  <  0,  that  is  k  <  0,  one  gets  because  of  (27)  the  complex  con¬ 
jugate  solutions. 

(3)  For  x  large  (or  k  small,  or  long  wave  length)  one  always  has 
the  two  roots,  and,  say,  for  the  one  in  the  first  quadrant  one  gets  the 
asymptotic  series: 


z  =  1  - 


J  +  2x2 


Putting  u>  =  to,  -/«,  this  leads  to  the  dispersion  law 

+  '  • 

ty- 

and  an  absorption  coefficient : 


V  =  ^  =  i  +  *1 

k  +  2y2 
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which  are  similar  to  the  series  expansions  (8)  and  which  are  in  fact 
identical  with  what  one  would  obtain  by  applying  the  Wang-Chang 
method  to  Eq.  (II). 

(4)  For  decreasing  x  (or  decreasing  wave  length)  V  and  a  increase 
monotonically  till  x  -  Jtcrlt  =  0.724-  •  For  x  <  jfcrit  there  are  no 
more  discrete  modes.  The  paths  of  the  two  roots  are  shown  in  Fig.  2 
in  the  (-plane  and  in  Fig.  3  in  the  a»plane.  The  velocity  increases 
about  10%  and  the  damping  increases  until  it  has  become  as  large  as 
for  the  continuum  modes.  A  similar  discussion  can  be  carried  out 
for  the  more  realistic  three-dimensional  equation  (10).  The  normal 
mode  spectrum  consists  of  a  continuum  for  which  the  imaginary  part 
of  <i»0  is  equal  to  -no,  and  three  discrete  modes  if  x  =  no/k  is  suffi¬ 
ciently  large.  For  x  -*■  oo,  these  modes  correspond  to  z  -  <o0,  k  —  0, 
±(5/6)'  a,  which  are  the  same  as  found  by  the  zero  approximation  of 
the  Wang-Chang  determinant  (7).  For  smaller  x  or  shorter  wave 
length  one  obtains  series  expansions  like  (8)  for  the  propagating 
mode  but  now  one  can  prove  that  these  series  are  asymptotic.  The 
third  mode,  that  is  the  one  for  which  r  becomes  zero  for  x  -»  oo, 
always  is  purely  imaginary  and  is  not  a  propagating  mode.  It  can 
perhaps  be  called  a  diffusion  mode.  There  is  again  a  critical  value  of 
x  for  which  the  damping  of  the  propagating  modes  has  become  no 
so  that  they  then  merge  with  the  continuum.  The  paths  of  the  discrete 
zero's  seem  more  complex  and  need  further  investigation.  It  may 
be  that  V  as  function  of  k  will  go  through  a  maximum  before  the 
critical  value  of  k  is  reached. 

From  the  physical  point  of  view,  it  seems  to  me  that  the  qualitative 
features  of  the  results,  namely :  the  existence  of  the  continuum  (or  of 
single  particle  modes),  sound  as  one  of  the  discrete  modes,  the  existence 
of  a  critical  frequency  for  the  sound  modes  are  probably  general  and 
are  physically  reasonable  consequences  of  the  existence  of  a  total 
cross-section;  that  is,  of  a  lower  bound  of  the  spectrum  of  relaxation 
times.  I  am  very  hopeful  that  it  will  lead  to  a  real  explanation  of 
what  sound  is  at  very  high  frequencies  and  will  help  to  explain  the 
Greenspan  experiments  and  perhaps  will  suggest  other  experiments. 


Notes  on  Chapter  V 


1.  M.  Greenspan,  J.  Acoust.  Soc.  Am.  22  (I9S0)  568.  In  the 
actual  experiments  the  frequency  of  the  sound  was  kept  fixed  and  the 
mean  free  path  A  was  varied  by  changing  the  pressure  of  the  Helium 
gas.  Since  the  relevant  parameter  is  X/A  this  is  equivalent  to  changing 
the  frequency  while  keeping  the  pressure  of  the  gas  constant.  The 
restriction  to  mono-atomic  gases  should  be  emphasized.  For  poly¬ 
atomic  gases  there  is  an  additional  cause  for  the  dispersion  and 
absorption  of  sound,  which  is  due  to  the  lag  in  the  transfer  of  the 
translational  energy  into  internal  energy,  and  which  usually  completely 
masks  the  dispersion  of  sound  with  which  we  are  concerned  here. 

2.  For  the  detailed  discussion  we  refer  to  the  report  cited  in  Note  7, 
Chapter  IV.  The  calculations  were  done  for  the  real  Maxwell  model. 
Since  in  this  case  there  is  no  total  cross-section  o,  another  form  for 
the  dimensionless  parameter  of  order  X/A  in  the  development  is  needed. 
We  chose  put/pVl,  where  p.  is  again  the  viscosity  coefficient.  The 
results  were: 


which  shows  how  fast  the  coefficients  increase. 

3.  None  of  this  work  has  as  yet  been  published.  A  joint  publica¬ 
tion  is  in  preparation. 

4.  The  one-dimensional  Boltzmann-like  equation  proposed  by 
M.  Kac  (Proceedings  of  the  Third  Berkeley  Symposium  on  Mathe¬ 
matics,  Statistics  and  Probability,  Vol.  3,  pp.  171-193)  has  the  form: 

% + v  fx = "  rr dvi  r*  * m{f(*'  ^  ^  ^  ° 


-  f(x,  V,  t)f(x,  vu  /)} 
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where  v'  =  v  cos  6  +  t>,  sin  6,  v[  =  - 1;  sin  6  +  t>,  cos  0  and  where 
F(0)  =  F(-0)  is  a  dimensionless  “collision”  probability.  Just  as 
the  real  Boltzmann  equation  it  leads  to  an  equilibrium  distribution : 

/o»  Ae-*"1 


and  by  linearizing,  putting  /  =  /0( I  +  h(x,  v,  r»,  one  obtains: 


(b) 


th  i)h  ... 
Fr*CTx~  " AJih) ’ 


7(A) 


=  f  +  *  7c,  exp(-c?/2)  f + *  70  F(9)(h'  +  A',  -  A  -  A,) 
J  -  ®  J  -  * 


with  c  =  t<2/9)*  2  and  r  =  /( 2/S)1'3.  It  is  easy  to  show  that  the  eigen¬ 
functions  of  the  dimensionless  linear  operator  7(A)  are  the  Hermite 
polynomials  HJc)  (also  sometimes  written  Hen(c),  since  the  weight 
function  is  exp(  —  ra/2))  and  that  the  corresponding  eigenvalues  are: 


(c)  A,  =  (2 :»>»  2  J  70  F(m cos  •)»  +  (-  l)"(sin  0)"  -  1  - 


In  terms  jf  the  eigenfunctions  one  can  write  7(A)  in  the  form: 
(d)  7(A)  =  f  7c,  L(c,  c,)A(c,) 

J  -  ao 

with: 


Ut,  c,)  =  £  n,(2A;)1;a  Hn(c)Hn(^)  exp(-c?/2) 

which  is  similar  to  Eq.  (9)  of  the  text. 

The  mutilation  of  the  linearized  equation  starts  from  the  form 
(d)  of  7(A).  Putting  A0  =  A,  =  0  and  all  other  eigenvalues  equal  to 
-  y/vA  leads  to  Eq.  ( 1 1 )  of  the  text.  It  should  be  pointed  out  that  this 
mutilation  is  not  analogous  to  what  was  done  in  the  three-dimensional 
case,  because  from  (c)  follows  that  A0  =  Aa  =  0,  while  all  other 
eigenvalues  are  negative  and  converging  to: 

-(2* )'  *  j**  d6  F(d). 

In  the  Kac  successive  approximation  method  applied  to  (d)  one  would 
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put  therefore  in  zeroth  approximation  A0  =  Aa  *  0  and  all  other 
eigenvalues  equal  to  -yI*>A.  This  leads  to  the  equation: 

»h  k  .  ,  dh  y  C*m  j  ^ 

57  +  >,*  +  c5J-(2^J..  exp<-c?/2) 

x  |l  +  i(c“  -  IKc?  -  1)W,). 

However  it  can  be  shown  that  this  equation  does  not  lead  to  collective 
modes  which  are  similar  to  sound. 


Chapter  VI 


The  Chapman-Enskog  Development 


1.  Introduction.  We  now  return  to  the  general  non-linear  Boltzmann 
equation: 

(l)  t  +  *■%+•■%-  f*xfdOgng'Wfi-ffi) 

discussed  in  Chapter  IV,  and  we  will  consider  the  general  problem  of 
how  the  usual  macroscopic  description  of  the  gas  in  terms  of  the 
hydrodynamical  equations  is  contained  in  the  Boltzmann  equation  (1). 
This  problem  has  a  long  history  and  both  Maxwell  and  Boltzmann 
have  given  it  a  great  deal  of  attention.  The  first  completely  satis¬ 
factory  solution  was  given  in  the  dissertation  of  Enskog  in  1917,  which 
was  inspired  by  Hilbert,  and  independently  by  Chapman,  who  used 
another  method  which  goes  back  to  Maxwell  but  which  led  to  the 
same  conclusions.1  I  will  only  try  to  sketch  this  theory,  and  I  want 
to  emphasize  the  mathematical  features,  which  I  think  are  quite 
curious  and  fundamental.  From  the  physical  point  of  view,  this 
theory  constitutes  one  of  the  few  successful  descriptions  of  a  class  of 
non-equilibrium  phenomena  in  terms  of  the  intermolecular  forces. 

2.  The  general  conservation  laws.  The  first  step  is  to  derive  from 

(1)  the  so-called  transport  equation  for  a  quantity  ^(v)  which  a  mole¬ 
cule  can  carry  “on  its  back".  Defining: 

(2)  t)  =  7 - 

dyf 


where  rH r,  t)  is  the  number  density,  one  gets  from  (1)  by  multiplying 
with  <p  and  integrating: 


j,  <"*>  +  w.  (m-*> 


na, 


dip 

advl 


=  1  jdy  jd>t  j dOgHg,  W  +  fi)[/7i'  -  ffx) 
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(3) 
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where  in  the  collision  term  the  same  transformations,  used  in  the  proof 
of  Boltzmann's  //-theorem  (see  Chapter  IV,  Note  2)  are  used  again. 
Take  now  for  0(v)  successively  the  five  quantities  1,  v,  va  which  are 
conserved  in  a  collision.  For  these  the  right-hand  side  of  (3)  vanishes 
and  after  a  few  simple  transformations  one  can  write  the  resulting 
five  equations  in  the  form: 


(a) 


(4)  { 


(b) 


(c) 


bp  f>  . 

bi  +  Wa(pUa)  =  °. 

Du,  _  / du ,  bu,\  8P ta 

p-Dl  =  p\-bI  +  UabTj  =  p<1‘--b^ 


R(Q 

'  Dl\p 


(5)  +  = 

\  P  /  VX' 


-  Pan  D, 


afi  L/ai I 


where  we  have  used  the  index  notation  with  the  usual  summation 
convention,  and  where: 


pi  r,  0  =  mn 

«i(r,  /)  =  f, 

U,  =  r,  -  u, 

Q(r,  l)  =  \  pU~2 


=  the  mass  density: 

=  the  average  velocity; 

=  the  thermal  velocity; 

=  the  thermal  energy  density; 


P{,(t .  /)  =  pU,U,  =  the  pressure  tensor  (obviously 

symmetric); 

qt( r,  t)  =  ^  pU,U 2  =  the  heat  current  density; 

Di,  =  1  i^r-1  +  —  the  rate  of  strain  tensor. 

2  \bx,  bx,l 


The  equations  (4)  express  the  conservation  laws  of  mass,  momentum 
and  energy.2  They  are  a  rigorous  consequence  of  the  Boltzmann 
equation,  but  of  course  they  are  still  an  empty  frame,  since  we  do  not 
know  how  to  express  the  P„  and  the  q,  in  terms  of  the  average  flow 
velocity,  the  density  and  the  temperature.  Or,  in  other  words,  one 
still  has  to  derive  the  Newton  and  Fourier  phenomenological  laws  for 
the  friction  and  the  heat  conduction. 


3.  The  Chapman-Easkog  development.  To  do  this  one  has  somehow 
to  solve  the  initial  value  problem  for  the  Boltzmann  equation,  since  to 
find  the  average  values  P„  and  qt  one  must  know  the  distribution 
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function/.  Now  we  discussed  already  in  Chapter  IV  that  the  approach 
to  equilibrium  proceeds  so  to  say  in  two  stages.  Because  of  the 
collisions  any  initial  distribution  will  reach  very  quickly  (in  a  time  of 
the  order  of  the  mean  free  time  t0  ~  A/(^r/m)1,a)  a  local  Maxwell 
distribution : 


where  the  macroscopic  variables  n,  u  and  T  are  still  functions  of  r 
and  t.  In  the  second  stage  the  slow  relaxation  of  the  macroscopic 
variables  to  their  equilibrium  values  takes  place,  and  it  is  this  stage 
which  we  want  to  follow  in  more  detail.  Now  the  argument  goes  as 
follows: 

I.  First  note  that  if  one  uses  (S)  to  calculate  Pu  and  qt  one  gets: 

9.  =  0,  Q  =  \p, 

(6) 

P„  =  pS(/  with  p  =  nkT 

so  that  (4)  becomes: 


(7) 


(a)  ^  +  div  (pu)  =  0, 

...  Du 

(b)  = 

(C)  §,(pT-32)  =  0 


grad  p. 


which  are  the  ideal,  or  Euler  hydrodynamical  equations  with  adiabatic 
temperature  changes  and  with  an  equation  of  state  which  is  still  the 
ideal  gas  law. 

2.  Of  course  the  two  stages  of  the  relaxation  to  equilibrium  are  not 
sharply  separated,  so  that  one  must  not  expect  that  after  a  time  of 
order  t0  the  distribution  function  is  precisely  of  the  form  (S).  Instead 
it  seems  more  reasonable  to  put  in  Eq.  (I): 


(8) 


/  =  /"”[  I  +  <Wr,  r,  /)] 


and  to  keep  in  the  collision  term  only  terms  linear  in  <f>.  Finally, 
since  it  is  the  streaming  term  in  (I)  acting  on  n,  u  and  T  which  causes 
the  deviation  <f>  from  the  local  Maxwell  distribution  (5),  it  is  sufficient 
to  put  only  /'0)  in  these  streaming  terms.  Thus  one  is  led  to: 


(9) 


G 


+  c 


b 

:  dxa 


+ 


_0 

bv„ 


)/,0) 


new 
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where: 

CW  =  J (hrfr  fdOgKg,  m  +  K  -  +  -  *1 1 

is  the  same  linearized  collision  operator  introduced  in  Chapter  IV. 
Note  however  the  different  way  in  which  one  now  has  linearized  the 
Boltzmann  equation.  The  resulting  equation  (9)  is  an  inhomogeneous 
linear  integral  equation,  in  contrast  to  the  homogeneous  equation 
discussed  in  the  previous  chapter. 

3.  One  now  runs  into  the  following  trouble.  The  homogeneous 
equation  C(<f>)  =  0  has  the  five  solutions  I,  v,  and  v 2  (which  I  will 
call  i/i,),  and  therefore  (9)  has  in  general  no  solution.  The  solubility 
conditions  are  that  the  left-hand  side  of  (9)  must  be  orthogonal  to  the 
0,  and  these  five  conditions  turn  out  to  be  the  Euler  equations  (7)  for 
n,  u  and  T  with  p  =  nkT.3  These  equations  allow  us  to  express  the 
time  derivatives  of  the  macroscopic  variables  in  terms  of  their  spatial 
derivatives  and  doing  this  one  then  finds:4 


(10) 


(—  +  t,  _5L  +  a  _IL\  /“>>  =  y<°>f_L  - L.  u  (JL L  u2  - 

\r>/  "  i)xa  a  fit  }  lr  fixa  2/ 


where  Ut  =  i>,  -  u ,  is  again  the  thermal  velocity.  With  (10)  for  the 
left-hand  side  of  (9),  this  equation  has  a  solution  of  course,  which  is 
also  unique  if  we  require  that: 


(11) 


=  0 


which  means  physically  that  the  macroscopic  variables  must  always  be 
determined  from  /<0>  alone. 

4.  With  the  solution  for  <f>  one  then  can  calculate  the  pressure 
tensor  P, ,  and  the  heat  flux  q,  for  which  one  finds  the  well-known 
Newton  and  Fourier  laws: 


(12) 


but  with  known  values  for  the  viscosity  and  heat  conduction  coefficients 
P  and  x.  Putting  (12)  into  the  general  conservation  laws  (4)  then 
leads  to  the  Navier-Stokes  equations  of  hydrodynamics. 
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4.  Solution  of  the  integral  equation  (9).  To  find  the  solution  for  <f>  is 
the  hardest  part  of  the  theory.  To  give  you  some  idea  how  it  goes 
let  me  consider  again  the  quasi-Maxwell  model.  Since  the  right-hand 
side  of  (10)  and  also  /<0)  depend  only  on  the  thermal  velocity  (/,,  <f> 
will  also  depend  only  on  (/,.  Using  dimensionless  velocity  variables: 


n 

II 

M 

C 

Eq.  (9)  becomes: 

*  ZT  I  3  5 

(  3)  T»xaC T  2 

)  +  2Dae(wr)  (CaCa  “  5  8«c3)  =  naJ^) 

with: 

M)  »  jdc,  e-'2>  jdG  F(0)  [0'  +  0;  -  0  -  ^,1. 
The  conditions  (I  I)  are  equivalent  to: 


W-c2m  =  0. 


Eq.  (13)  is  especially  simple  for  the  Maxwell  model,  since  the  functions 
r,( 5/2  -  c *)  and  r,r,  -  (l/3)5„r2  which  occur  on  the  left-hand  side  of 
(13)  are  eigenfunctions  of  the  operator  In  the  notation  of 

Chapter  IV  they  are  </>um  (with  m  =0,  ±1)  and  0O am  (with  m  =  0, 
±1,  ±2).  Since  furthermore  I,  c,  ca  -  3/2  are  the  eigenfunctions 
0 ooo.  toim  and  0,oo  a  linear  combination  of  the  functions  4>um  and 
0o2m  will  also  fulfill  the  condition  (14).  Hence  one  easily  sees  that  an 
exact  solution  of  (13)  is: 


(14) 


h 


(15) 

0(c) 


— J —  i  —  c  (ca  - 
«<rA,,  Tdxa  °\  2/ 


where  the  eigenvalues  An  and  A02  can  be  found  from  the  general 
formula  (24)  of  Chapter  IV.  This  formula  gives: 


(16) 


^  Jrfflsin3  6F(6). 
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One  then  finds: 


nm 


=  ImlLi  2*ZY,a  bx, 

4  aAn  \  m  )  dxt 

Pi)  =  ?W)  =  PK  +  ^  jdc  e-'3<Hc )ctc, 

'  pS"  +  SC  "  3  °“S")' 


One  gets  therefore  the  Fourier  and  Newton  laws  (12),  and  one  obtains 
for  the  heat  conductivity  and  the  viscosity  coefficients  the  explicit 
expressions : 


07) 


5m*/2A7Y'2  = _ m  /2 kTV  2 

4  «rA, ,  \  m  )  ’  ^  2crA02  \  m  } 


which  are  both  positive  since  Au  and  A02  are  negative.  Both  are 
independent  of  the  density  and,  therefore,  functions  of  the  temperature 
alone.  The  independence  of  density  is  one  of  the  classical  results  of 
the  kinetic  theory  of  gases  and  is  in  accord  with  experiment.  The 
temperature  dependence  is  different  for  different  molecular  models. 
The  dependence  ~  T,  a  which  holds  according  to  (17)  for  the  quasi- 
Maxwell  model  (if  a  is  taken  independent  of  T),  is  also  valid  for  the 
elastic  sphere  model.5  It  only  agrees  qualitatively  with  experiment. 

From  (17)  one  still  finds: 


(18) 


5 

2  ci  P 


which  is  confirmed  very  well  for  mono-atomic  gases;  5/2  is  the  so- 
called  Eucken  constant. 

Also  for  other  molecular  models,  in  fact  for  any  differential  cross- 
section  l(g,  6)  it  is  possible  to  find  the  solution  of  the  integral  equation 
(9).  The  best  method  (first  suggested  by  Burnett)  is  to  develop  <f> 
always  in  the  eigenfunctions  ^r(m  of  the  Maxwell  model.  For  the 
coefficients  one  then  gets  an  infinite  set  of  linear  equations,  which  can 
be  solved  successively  and  at  least  for  repulsive  forces  inversely 
proportional  to  some  power  of  r,  the  convergence  of  this  procedure 
has  been  established.6 

It  should  be  emphasized  that  one  thus  obtains  k  and  p  expressed 
explicitly  in  terms  of  Kg,  0)  and  hence  in  terms  of  the  force  law.  One 
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can  therefore  predict  the  magnitude  and  temperature  dependence  of 
«  and  n  if  one  knows  the  intermolecular  forces  as  deduced,  say,  from 
the  second  virial  coefficient.  Hirschfelder  and  co-workers  have  made 
such  calculations  and  they  got  quite  a  good  check  with  experiment. 
1  think  there  is  no  doubt  that  one  has  found  in  this  way  the  right 
explanation  of  the  non-equilibrium  macroscopic  phenomena  of  internal 
friction  and  heat  conduction  of  a  gas. 

5.  Higher  approximations;  general  form  of  the  development.  The 

Chapman-Enskog  development  is  a  successive  approximation  or 
•perturbation  method  (with  the  local  Maxwell  distribution  (5)  as  the 
zeroth  order)  which  can  be  continued  in  a  systematic  way.  Since  the 
formulas  are  a  bit  long,  let  me  only  say  that  in  the  nth  order  one  gets 
for  the  deviation  <f>,n)  from  f'0)  an  inhomogeneous  integral  equation 
of  the  same  form  as  (9),  that  is: 

(19)  f'O'CW’")  =  LK 

where  Ln  depends  on  the  lower  order  deviatic  rjs.  The  solubility  con¬ 
ditions  are  (or  better,  the  method  »  so  arranged  that  they  are)  the 
hydrodynamical  equations  for  the  macroscopic  quantities  found  in 
the  previous  order,  just  as  the  solubility  conditions  for  (9),  that  is  for 

/,0W’)  =  Lx 

are  the  zero-order  or  Euler  hydrodynamical  equations.  Together 
with  an  expansion  of  the  distribution  function  of  the  form: 

(20)  /  =  /,0,(l  +  *“>  +  V2'  +•••) 

one  obtains  therefore  successive  order  hydrodynamical  equations: 
0th  order  Euler,  1st  order  Navier-Stokes,  2nd  order  Burnett  equations, 
and  so  on. 

What  is  the  expansion  parameter?  It  turns  out  that  the  expansion 
(20)  can  be  written  symbolically  as: 

/  =  /'°’ll  +  fl,(r)(AV)  +  u2(r)(AV)a  +  •  •  ] 

where  A  ~  (no)  ~ 1  is  the  mean  free  path,  and  the  gradient  acts  on  the 
macroscopic  variables  u(r,  r)  and  T( r,  t).  Compare  Eq.  (15)  for  the 
first  approximation.  In  the  second  approximation  one  gets  both 
second  derivatives  of  u  and  T and  squares  of  the  first  derivatives.  One 
can  therefore  say  that  the  expansion  parameter  is  the  relative  change  of 
the  macroscopic  variables  over  a  mean  free  path.  I  will  call  it  the  uni¬ 
formity  parameter  0.  Only  if  this  parameter  is  small,  that  is  if  the 
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u  and  T  vary  slowly,  will  the  expansion  show  a  semblance  of  con¬ 
vergence.  For  sound  waves  6  ~  \/A,  so  that  the  Chapman-Enskog 
development  can  be  expected  to  give  expansions  of  the  velocity  and 
absorption  coefficient  as  discussed  in  Chapter  V.  In  fact  if  one 
calculates  say  with  the  Maxwell  model  the  dispersion  law  of  sound  from 
the  successive  order  hydrodynamical  equations,  then  one  obtains  the 
same  expansion  as  found  by  the  Wang-Chang  method.  Clearly  for 
strong  shock  waves  where  6  ~  X/d  s  1,  d  =  thickness  of  the  shock, 
the  Chapman-Enskog  development  will  not  give  a  good  approximation. 

Finally  note  that  the  successive  order  hydrodynamical  equations 
are  always  of  the  first  order  in  the  time  derivatives  of  the  macroscopic 
variables,  but  are  successively  of  higher  order  in  the  space  derivatives. 
If  one  wants  to  apply  these  equations  to  the  flow  of  the  gas  around 
solid  bodies  or  between  solid  walls,  the  question  of  the  boundary 
conditions  has  to  be  answered.  More  and  more  conditions  are  clearly 
required  and  one  can  expect  therefore  successive  boundary  layer 
phenomena  similar  to  the  well-known  Prandtl  boundary  layer  theory 
which  is  so  characteristic  for  the  approximation  of  the  Navier-Stokes 
equations  by  the  (lower  order)  Euler  equations.  However  beyond  the 
Navier-Stokes  equations  this  question  has  not  been  investigated 
systematically,  which  is  one  of  the  reasons  why  the  number  of  applica¬ 
tions  of  the  Burnett  equations  has  been  very  limited.  Physically  it  is 
reasonable  and  customary  to  assume  that  the  molecules  which  hit  the 
wall  are  partially  specularly  reflected  and  that  the  remaining  ones  are 
re-emitted  by  the  wall  with  a  Maxwell  distribution  with  a  temperature 
and  average  velocity  determined  by  the  wall.  However,  it  is  not  clear 
how  one  can  express  this  picture  in  terms  of  boundary  conditions  on 
the  macroscopic  variables  which  at  every  stage  would  insure  an  unique 
solution  of  the  flow  problem. 

6.  The  character  of  the  Chapman-Enskog  development.  Let  me 
summarize  in  more  general  terms  what  we  have  found  so  far.  By 
means  of  a  successive  approximation  method,  we  have  obtained  a 
solution  of  the  Boltzmann  equation  of  the  form: 

(21)  /  =  /'°’(r,  vjn,  u,  T)  +  /»>(r,  «>|«,  u,  T)  +  •  •  • 

where  each  approximation  f{k)  is  a  function  of  r,  v  and  of  the  macro¬ 
scopic  variables  n,  u,  T  and  their  spatial  derivatives.  The  fik)  do  not 
depend  on  the  time  explicitly;  in  fact,  remember  that  at  any  stage  we 
eliminated  the  time  derivatives  by  means  of  the  corresponding  lower 
order  hydrodynamical  equations.  The  time  dependence  is  completely 
governed  by  the  time  dependence  of  the  macroscopic  variables  and 
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these  are  determined  by  the  hydrodynamical  equations  of  successive 
order  which  are  obtained  at  the  same  time  as  (21),  and  which  are  of 
the  form: 


dn 

di 


dx. 


(nu„). 


(22)  jj  =  V}u(r\n,  u,  T)  +  V?Xx\n,  u,  T)  +  ■  •  ■, 


^  =  e(1>(r|«,  u,  T)  +  9'3'(r\n,  u,  T)  +  •  •  •. 


Here  again  the  successive  approximations  V }k)  and  9lk)  in  the  equation 
of  motion  and  in  the  energy  equation  depend  on  n,  u,  T  and  their 
spatial  derivatives  but  they  do  not  depend  on  the  time  explicitly. 
Both  in  (21)  and  (22)  the  successive  orders  may  be  considered  to 
correspond  to  successive  powers  of  the  uniformity  parameter  0,  which 
is  of  the  order: 

(23)  0  s  X/L  i  t0/Tn 

where  A  =  mean  free  path;  t0  =  A /(kT/m)112  =  mean  free  time  or 
average  time  between  collisions;  L  s  0/grad  </>  with  <f>  any  macroscopic 
quantity ;  T0  2  L/  V0  -  macroscopic  relaxation  time  with  V0  =  sound 
velocity. 

The  curious  mathematical  feature  of  the  solution  (21)  of  the  Boltz¬ 
mann  equation  is  that  apparently  the  whole  temporal  development  of 
/is  determined  by  giving  the  initial  values  of  n,  u  and  T (which  accord¬ 
ing  to  (22)  determine  ail  later  values),  while  from  the  Boltzmann 
equation  itself  follows  that  one  needs  the  whole  initial  distribution 
/( r,  v,  0),  which  is  of  course  much  more  information  than  just  the  first 
five  moments  in  velocity,  /»( r,  0),  «(r,  0)  and  7T(r,  0).  Since  Hilbert 
pointed  out  this  feature  of  the  solution  (21),  I  usually  call  it  the 
Hilbert  paradox.7 

Physically  one  must  expect  that  an  initial  distribution  /( r,  v,  0)  in  a 
time  of  order  t0  relaxes  to  a  solution  of  the  form  (21),  whatever  the 
initial  distribution  is.  Or  one  can  say  that  after  a  time  of  order  t0 
(initial  chaotization  period)  a  contraction  of  the  description  of  the  state 
of  the  gas  is  possible,  in  which  the  temporal  development  will  be 
determined  by  much  fewer  variables.  And  the  reason  that  these 
variables  are  the  macroscopic  variables  n,  u  and  T,  is  clearly  because 
they  correspond  to  the  five  quantities  I,  v  and  v3  which  are  conserved 
in  a  collision.  Therefore  the  collisions  can  not  affect  the  n,  u  and  T 
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directly.  They  will  stay  constant  in  a  time  of  order  t0  and  in  this 
sense  one  can  say  that  they  are  constants  of  the  motion.  They  will 
change  with  time  only  “secularly"  through  the  change  of  the  distribu¬ 
tion  function  and  after  a  while  they  therefore  completely  govern  the 
temporal  development  of  the  state  of  the  gas. 

In  my  opinion,  the  idea  of  the  contraction  of  the  description  of  the 
temporal  development  of  the  system,  is  a  very  fundamental  insight. 
It  reminds  one  of  the  description  through  the  “macroscopic"  variables 
yu  y»,  ■  ■  ■  in  the  Boltzmann-Gibbs  picture  of  the  approach  to  equili¬ 
brium,  which  we  discussed  in  Chapter  I.  However  now  the  variables 
are  not  determined  more  or  less  arbitrarily  by  a  macroscopic  observer, 
but  are  determined  by  the  system  itself,  and  it  is  shown  that  they  form 
a  closed  deterministic  system  for  themselves,  that  is  to  say  that  their 
initial  values  determine  their  later  values. 

Of  course  this  contraction  of  the  description,  although  very  plausible, 
has  not  really  been  proved!  One  would  like  to  know  in  which  precise 
sense  the  solution  of  the  initial  value  problem  of  the  Boltzmann 
equation  is  approximated  by  the  Chapman-Enskog  expansion  (I), 
and  what  the  nature  of  the  expansion  is.  I  think  it  is  very  likely  that 
the  expansion  is  not  convergent  but  asymptotic  (as  suggested  by  the 
application  to  the  dispersion  of  sound),  and  that  the  approach  of  the 
/( r,  v,  /)  to  a  solution  of  type  ( 1 )  is  exponential  or  in  other  words  that 
the  difference  goes  to  zero  as  exp ( -r//0).  The  latter  is  indicated  by 
the  work  of  Grad  (Comm.  Pure  Appl.  Math.  2  (1949)  331).  Instead 
of  the  Chapman-Enskog  expansion.  Grad  has  proposed  an  approxima¬ 
tion  procedure  in  which  the  deviation  ^  from  the  local  Maxwell 
distribution  /“”  is  developed  in  the  appropriate  three  dimensional 
Hermite  polynomials  in  the  thermal  velocity  U.  The  coefficients 
(which  are  successive  moments  of  the  distribution  /)  are  functions  of 
r  and  /,  and  the  Boltzmann  equation  then  leads  to  partial  differential 
equations  in  these  coefficients.  It  is  now  quite  striking,  that  Grad 
could  show  that  all  moments  which  do  not  correspond  to  the  macro¬ 
scopic  variables  «,  u  and  T  (which  are  the  first  five  moments)  decay 
like  exp ( —  r//0)  to  the  values  which  would  follow  from  the  Chapman- 
Enskog  development.  I  think  it  confirms  and  substantiates  the  idea 
of  the  contraction  of  the  description. 


Notes  on  Chapter  VI 


1.  D.  Enskog,  Kinetische  Theorie  der  Vorgange  in  Mdssig  Ver- 
diinnten  Gasen,  Dissertation,  Uppsala,  1917. 

S.  Chapman,  Philos.  Trans.  Roy.  Soc.  London.  Ser.  A  216  (1916) 
279;  217(1917)  115. 

The  monograph  of  S.  Chapman  and  T.  G.  Cowling  ( The  Mathe¬ 
matical  Theory  of  Non-uniform  Gases,  Cambridge  University  Press, 
1939;  2nd  edition,  1955)  follows  the  method  of  Enskog.  We  refer 
to  this  book  for  further  details.  Compare  also  the  treatment  of  the 
theory  by  L.  Waldmann  in  the  Handbuch  der  Physik,  Vol.  XII, 
Section  E,  p.  384  and  in  the  book  of  J.  O.  Hirschfelder,  C.  F.  Curtiss 
and  R.  B.  Bird  ( Molecular  Theory  of  Gases  and  Liquids,  John  Wiley 
and  Sons,  New  York,  1954,  Chapters  7  and  8). 


2.  The  form  of  the  last  equation  in  (4)  is  perhaps  not  quite  familiar. 
Phenomenologically  it  can  be  derived  as  follows.  Consider  a  certain 
portion  of  the  gas  of  volume  Si,  then  the  work  done  per  second  by  the 
surrounding  gas  and  by  the  outside  forces  on  the  gas  inside  Si  is: 


(a) 


d<y„  P aflUg 


+ 


dr  pa„ua 


where  £  is  the  surface  of  Si  and  dol  is  a  surface  element  with  the 
normal  outwards.  Using  Gauss’  theorem,  the  equation  of  motion 
(Eq.  (4b))  and  the  symmetry  of  the  pressure  tensor  P„,  one  can  trans¬ 
form  (a) into: 


(b) 


J  j  j  dr  +  PUa 


Du*  1 
Dt  J 


This  must  be  equal  to  the  increase  per  second  of  the  kinetic  and 
thermal  energy  of  the  gas  inside  Si  plus  the  kinetic  and  thermal  energy 
which  flows  out  of  Si  per  second.  Let  e  =  (l/2)w*  +  (Qlp)  be  the 
total  energy  per  gram  and  let  s  =  pru  +  q  be  the  total  energy  flux 
density.  Then  one  has  therefore: 
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I 

I’ 


e 


HI*  [!<,»«)+ ^(p™.) +g-j 


+ 


jty«l 

8xa\ 


///*(' 


Dc 

Z>/ 


+ 


a?g\ 


using  the  continuity  equation.  Since: 


De 


P  a.  P^a  a. 


0/ 


_0 /(? 

Dt 


(?) 


this  gives  the  integral  form  of  Eq.  (4c). 


3.  Although  this  is  almost  self-evident  from  the  way  the  Euler 
equations  (7)  follow  through  (4)  from  the  general  transport  equation 
(3),  one  might  as  well  verify  it  directly.  The  orthogonality  of  the  left 
side  of  (9)  to  =  1,  leads  to: 


b 

bi 


+ 


a“  /  dy 


bf'0) 

<*l'a 


=  0. 


The  last  integral  is  zero  and  the  first  two  terms  clearly  give  the  con 
tinuity  equation: 


(c) 


bn  b  .  .  . 

_  +  =  o. 


The  orthogonality  condition  for  i>,  leads  to: 


Carrying  out  the  integrals,  one  obtains: 

b  b  b  tnkT\  „ 

<d>  a +  W.  +  If,  (-S’)  "  =  0 

and  this  becomes  the  equation  of  motion  (7b)  by  using  (c)  and  by 


! 
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putting  p  -  nkT.  We  leave  it  to  the  reader  to  show  that  the  ortho¬ 
gonality  condition  for  if>&  =  v*  leads  to  (7c). 

4.  By  differentiating  logarithmically  one. gets: 


/'•>  (ei  +  bx.  +  “■  bvj  f” 

*  7F™  (I  +  '■  w}"T  +  2TT>  t,*(§F  +  '*  w) 

,  m  .  .  ,  0Ka\  m  .  . 

+  kTUa  Ua)  \  bt  +  r>  dxj  kTa°(Va  Uo)' 

Now  from  (7): 


l(wr"32)=-M»^(wr'3,’>’ 

bT  =  _  67  _  2  dua 

bl  M“  3  e»x0’ 


<u,  _  ,bu,  I  bp 

bl  *  bx,,  1  p  bx, 


Substituting  these  expressions  for  the  time  derivatives  in  (e),  and  using: 


%  =  kT? 

f.v, 


+  kn 

bx,  bx, 


/i  fcn  3  dT 

t -</»7  32)  =  r3J^-  ,  nT6ay- 
fx,  ex,  2  bx, 


one  obtains  Eq.  (10)  of  the  text,  and  it  is  easy  to  verify  that  the  right- 
hand  side  of  (10)  is  orthogonal  to  I,  U,  and  IJ2  and  therefore  to  the  </>,. 

5.  See  Chapman  and  Cowling,  l.c.  Chapter  10,  where  one  finds 
the  explicit  expressions  of  and  *  for  various  molecular  models. 
For  the  real  Maxwell  model  p  and  *  are  proportional  to  the  first 
power  of  the  temperature.  That  we  found  for  the  quasi  Maxwell 
model  a  dependence  ~7‘'2  is  clearly  due  to  the  artificial  assumption 
that  the  dimensionless  function  F(8)  is  independent  of  the  temperature. 

6.  See  again  Chapman  and  Cowling,  l.c.  Chapters  7,  9  and  10,  and 
for  the  Lennard-Jones  (6-12)  potential,  Hirschfelder,  Curtiss  and  Bird, 
l.c.  Chapter  8,  §4.  The  papers  of  D.  Burnett  are  in  the  Proc.  London 
Math.  Soc.  39  (1935)  385  and  40  (1935)  382. 

In  first  approximation  for  any  differential  cross-section  l(g ,  8)  one 
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puts  <f>  equal  to  a  linear  combination  of  the  same  functions  and 
foam  occurring  in  the  solution  for  the  Maxwell  model.  Using  ordinary 
units  this  means  that  one  makes  the  Ansatz: 


(f)  *  =  fWu  U{aU*  ~  1)*‘  +  2aD*'(U»U'  ~  \  S«vt/3)xa 

where  a  =  mjlkT,  and  xu  Xa  are  constants.  This  leads  to  the  heat 
flux  density : 


qt  =  |  j dV  UtUaf'0' 


lUa 


with: 

(g) 


—  K 


8T 

8Xi 


K 


Xl- 


Similarly,  the  second  part  of  the  Ansatz  (f)  leads  to  the  viscosity 
coefficient : 


(h) 


rnn312  /a\312 

**  =  -  2^2  "W  ** 


Of  course,  the  Ansatz  (f)  can  now  only  be  an  approximate  solution 
of  the  Enskog  integral  equation  (see  Eqs.  (9)  and  (10)  of  the  text): 

tw  u-{aUa  ~  1) +  2aDu\u*u>  -  \  8«v^3)  = 

C(4>)  =  «^)3'a  JrfU,  e-ui  j diigHg,  0)W  +  -  *  -  *,) 

and  in  order  to  determine  the  Ansatz  (f)  completely  one  must  still  find 
the  quantities  x\  and  xa-  To  do  this  wc  require  that  they  are  the  first 
development  coefficients  in  the  development  in  the  eigenfunctions 
*l*rin  (f  =  1,  2-  •  •)  and  (r  =  0,  I,  2,  ■)  of  the  Maxwell  model. 

This  leads  immediately  to: 


jjUe-au3UuUu\ 

all2  - 

r 

J 

dUe-aV3Uu^U2  -  ? 
jdVe  aV3{uuU,  -  ^ 

K.V‘) 

(—a* 

(u.u,  -  58„vl/3) 

‘“j 

„v^3)< 

:{vuuy  -  1  sBV(/a) 
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The  calculation  of  the  integrals  is  straightforward  and  we  will  only 
indicate  some  of  the  details  for  xi •  The  numerator  is: 

(k)  4w  J*  dU  U*(aU>  -  ^ 

The  denominator  by  means  of  the  Boltzmann  //-theorem  transforma¬ 
tion  can  be  written  as: 

(!)  -  iw(;)3  2  f  jdUdUle-'"J*+vV  jdQgI(g,6) 

i\vkv‘  -  - 1)] 

where  the  d-symbol  means 

d(/(U))  =/(U')  +/(Ui)  -/(U)  -/(Ux). 

Clearly  the  terms  with  the  factor  5/2  in  the  d-symbols  in  (1)  can  be 
omitted  because  of  momentum  conservation.  Introduce  now  the 
center  of  mass  velocity  Uc  =  (l/2)(U  +  Ux)  =  (1/2)(U'  +  Ui),  then: 

U  =  Uc  +  ig,  U'  =  Uc  +  ig\ 

K.  Ux=Uc-ig', 

(/*+£/?  =  2 V\  +  ^  g\  dU  d\J1  -  dVe  dg, 
d((/,t/3)  =  (Ueg')g,'  -  (U.-gfr,. 

The  integral  over  Uc  can  then  be  carried  out  and  (I)  becomes  after 
integrating  over  the  directions  of  g: 

(m)  -  Jo  dg  g1  exp(  -  ?  g3)  j dil  ( 1  -  cos3  0)I(g,  0). 

This  is  as  far  as  one  can  carry  out  the  integral  without  making  special 
assumptions  about  /(g,  0).  Introducing  (k)  and  (m)  in  (i)  and  the 
resulting  value  for  *i  in  the  equation  (g)  for  the  heat  conductivity 
coefficient  gives  the  final  answer: 

K  =  UnmkTY"  -jrs - - - 

dG&e-°aQ(g) 


NOTES  ON  CHAPTER  VI 


117 


where: 


and  Q(g)  is  the  so-called  transport  cross-section: 

Q(g)  =  lit  J  dO  sin3  0  J(g,  0). 

The  calculation  of  x»  and  hence  of  p  goes  along  similar  lines,  and  one 
finds: 

h  =  a  {nmkTy2  — - - - 

Jo  dGCPe*  Q(g) 

so  that  the  relation  ( 1 8)  of  the  text  remains  valid  in  this  approximation. 

7.  Hilbert  in  his  paper  on  the  kinetic  theory  of  gases  ( Grundziige 
einer  AUgemeinen  Theorie  der  Linearen  Integralgleichungen,  Teubner 
Verlag,  Leipzig,  1912;  Chapter  22)  speaks  of  the  macroscopic  causality 
theorem  in  this  context.  In  our  opinion  the  word  theorem  is  con¬ 
fusing,  because  macroscopic  causality  (which  means  that  the  state  of 
the  gas  is  determined  by  giving  at  time  /  =  0  the  density,  velocity  and 
temperature  distribution)  is  clearly  a  requirement  which  one  imposes 
on  the  solution  of  the  Boltzmann  equation.  The  word  theorem 
suggests  also  that  the  macroscopic  causality  has  been  proved  by  the 
Chapman-Enskog  development,  which  is  clearly  not  the  case. 


Chapter  VII 


The  Kinetic  Theory  of  Dense  Gases 


1.  Introduction;  the  problem  of  deriving  the  Boltzmann  equation.  The 

recent  development  of  the  kinetic  theory  of  dense  systems  has  started 
from  the  question  how  to  derive  the  Boltzmann  equation  from  the 
basic  Liouville  theorem.  Since  the  subject  is  still  controversial,  let 
me  first  try  to  explain  why  one  would  want  to  derive  the  Boltzmann 
equation. 

It  is  of  course  true  that  the  way  Boltzmann  arrived  at  his  equation, 
which  we  presented  in  Chapter  IV,  was  heuristic  and  that  it  therefore 
invites  a  critical  analysis.  Historically  the  criticism  of  the  kinetic 
method  of  Boltzmann  was  mainly  directed  toward  the  //-theorem 
and  was  an  expression  of  the  feeling  of  discomfort  that  somehow  using 
reversible  mechanical  models,  Boltzmann  had  succeeded  in  explaining 
the  typical  irreversible  approach  to  equilibrium.  It  somehow  seemed 
a  swindle!  I  will  not  discuss  the  famous  classical  objections  of 
Loschmidt  and  Zermelo.  since  they  were  answered  by  the  statistical 
method  of  Boltzmann  and  the  Boltzmann-Gibbs  picture  which  evolved 
from  it.  But  it  leaves  completely  open  the  question  of  how  the  kinetic 
method  and  especially  the  Boltzmann  equation  fits  into  the  general 
picture  of  the  approach  to  equilibrium. 

Besides  the  logical  gap,  there  are  also  practical  reasons  why  one 
would  like  to  derive  the  Boltzmann  equation.  Clearly  the  applica¬ 
bility  of  the  Boltzmann  equation  is  limited  to  sufficiently  dilute  gases, 
so  that  only  binary  collisions  have  to  be  taken  into  account.  It  is  the 
restriction  to  binary  collisions  which  has  as  consequence  that  the 
transport  coefficients  x  and  n  are  independent  of  the  density.  Qualita¬ 
tively  this  comes  about  since  if  only  binary  collisions  are  important 
the  mean  free  path  A  is  inversely  proportional  to  the  number  density 
(i.e.  A  l/«<r),  and,  hence,  in  all  transport  processes  one  has  an  exact 
compensation  of  the  number  of  transporting  particles  ( ~  zr)  and  the 
amount  transported  by  each  of  them  (  ~  A,  hence  -  1  /«)  so  that  the 
result  is  independent  of  n.  If  triple  and  higher  order  collisions  must 
be  taken  into  account,  one  would  expect  that: 

its 
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(1) 


nail  +  C\tl  +  Cafl*  +  •  •  •) 


since  the  number  of  triple,  quadruple  -  -  -collisions  in  which  a  single 
given  particle  takes  part  will  clearly  be  ~/ta,  n3,  •  •  -.  The  compensa¬ 
tion  of  the  number  of  particles  transporting  and  the  amount  trans¬ 
ported  by  each  would  no  longer  be  exact,  and  one  would  expect 
virial  expansions  for  the  transport  coefficients  of  the  form : 


(2) 


M  =  Mo  (T)  +  Mi  (T)n  +  fta(T)na  +  •  •  • , 
*  =  *o (T)  -4-  *i(T)n  +  na(T)n a  +  ■  ■  ■ 


where  ho  and  *0  are  the  values  given  by  the  Chapman-Enskog  theory. 
In  addition  one  would  expect  that  at  higher  densities  even  for  mono- 
atomic  gases,  the  so-called  second  viscosity  coefficient  (or  bulk  vis¬ 
cosity)  would  appear  ( ~  /*2, 1  think,  since  it  will  depend  on  the  number 
of  quasi  binary  molecules).  There  is  considerable  experimental 
material  about  the  density  dependence  of  *  and  m  and  of  the  bulk 
viscosity  for  dense  gases,  and  at  present  one  is  still  completely  unable 
to  correlate  the  experiments  with  theory.  In  this  respect  the  contrast 
with  the  equilibrium  theory  of  non-ideal  gases  is  quite  striking. 
Clearly  the  expected  expansions  (2)  are  quite  similar  to  the  virial 
expansion  of  the  equation  of  state : 


(3) 


pv  _  -  .  b(T)  cm  ,  D{T) 

kT~  '  +  v  v *  +  r3 


In  fact  we  saw  that  B(T)  and  mo (T),  k0(T)  are  directly  related  to  each 
other  since  they  have  to  do  with  only  binary  interactions.  Similarly 
one  would  expect  that  the  third  virial  coefficient  C(T)  would  be  related 
to  mi (T)  and  x^T),  and  so  on.  But,  while  we  know  the  explicit 
expressions  for  C(T),  CUT),  -  in  terms  of  the  intermodular  forces, 
a  similar  expression  for  the  transport  virial  coefficients  is  not  known. 

To  come  back  to  the  Boltzmann  equation,  it  is  clear  that  to  develop 
a  theory  for  the  virial  expansion  of  the  transport  coefficients  one  must 
generalize  the  Boltzmann  equation  so  as  to  take  into  account  the  triple 
and  higher  collisions.  Unfortunately,  lacking  Boltzmann's  intuition, 
it  has  (at  least  till  now!)  not  been  evident  how  one  should  do  this,  and 
hence  one  has  been  forced  back  to  see  whether  one  could  “derive” 
the  ordinary  Boltzmann  equation  from  a  more  fundamental  point  of 
view,  with  the  hope  that  this  would  then  also  indicate  the  generaliza¬ 
tion  to  higher  densities. 

In  this  chapter  I  will  present  the  ideas  and  the  method  of  Bogoliubov. 
I  should  say  that  although  these  ideas  are  in  my  opinion  very  plausible 
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and  although  the  method  has  at  least  the  virtue  that  it  gives  a  definite 
recipe  how  to  generalize  the  Boltzmann  equation,  it  is  not  so  that  they 
have  led  to  definite  results,  which  are  physically  so  ev  <’ent  or  which 
are  so  completely  in  agreement  with  experiment,  that  there  could  not 
be  any  doubt  about  their  validity.  As  a  result  the  subject  is  still 
controversial.  Other  methods  to  derive  the  Boltzmann  equation  have 
been  proposed  by  Kirkwood  and  co-workers,  by  M.  S.  Green,  by 
Prigogine  and  co-workers,  and  by  a  number  of  other  authors.1  It 
seems  to  me  that  the  various  methods  begin  to  converge  towards  each 
other  and  perhaps  they  will  justify  the  ideas  of  Bogoliubov.  But 
further  clarification  is  certainly  needed! 

2.  The  B-B-G-K-Y  hierarchy.  Since  we  do  not  know  how  to  proceed 
by  intuitive  methods,  we  have  to  go  back  to  the  Liouville  equation, 
which  is  so  to  say  the  basic  equation  of  statistical  mechanics.  Con¬ 
sider  again  our  system  of  N  particles  with  known  interactions  in  a 
volume  V.  The  state  of  the  whole  gas  is  described  by  the  probability 
distribution  in  T-space  DN(xu  xa,  ■  ■  •,  jrw;  /),  where  jr,  s  (r,,  p,) 
denotes  the  coordinates  and  momenta  of  the  ith  particle.  We  take 
this  distribution  to  be  normalized  to  unity : 


(4) 


dx i  ■  ■  ■  dxN  Dn(x1,  ■  ■  ■ ,  xN,  t) 


1 


and  require,  for  a  gas  of  identical  particles,  that  DN  be  a  symmetric 
function  of  jr,,  x3,  ■  ■  ■ ,  xN.  The  change  of  DN  with  time  is  governed  by 
the  Liouville  equation : 


(5) 

with 


dPN 

at 


{Hs,  dn }  s 


v  leJkdJh< 

fa  \  dr,  dp, 


dJhdDjA 
dp,  dr,  f 


(6) 


Hn  = 


-  */!)• 


Here  U(r,)  is  the  wall  potential  (no  other  outside  forces  will  be  assumed) 
and  <K\r,  -  r,| )  is  the  interaction  potential  which  is  assumed  to  be  of 
the  usual  van  der  Waals  form  discussed  in  Chapter  I. 

Instead  of  using  Poisson  brackets  as  in  (5)  it  is  convenient  to  intro¬ 
duce  a  Hamiltonian  operator: 


xN(xr  -xN)  =  2  -  2  6 

i -i  m  dr,  fa 


a  -  8 

u  dr,  dp,  +  dr,  dp/ 


(7) 
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then  (5)  can  be  written: 

(8) 

Note  that  we  have  omitted  the  wall  potential  due  to  the  vessel  V,  so 
that  strictly  (8)  is  only  valid  if  the  particles  are  inside  V.  We  will 
soon  go  to  the  limit  N  -*■  oo,  V  -»  oo,  v  =  V/N  finite  and  then  the 
effect  of  the  walls  will  drop  out  completely  so  that  (8)  can  be  used. 
From  (5)  or  (8)  follows  by  integration  a  hierarchy  of  equations  for  the 
partial  distribution  functions: 

(9)  F,(x c„  •,  *,)  =  V  J-  •  •  J Dn  dx„  i-  •  dxN 

which  has  been  derived  by  many  authors  and  which  I  call  therefore 
the  B(ogoliubov)-B(orn)-G(reen)*K(irkwood)-Y(von)  hierarchy.®  One 
finds: 


(10)  +  X ,F,  «  1  jdx,tl  ^  +  ,F, * 

Proof.  Integrating  (8)  over  x,  +  l  **  and  multiplying  with  V\ 
one  obtains: 


+  3T,F,=  V'j  -  jdx,tl 


+  2  +  2  f^*(*i»  ■  ■  ■*  xn*  *)• 

1  S *:  ■  +  1  S  /  S  V  I+1SKI  j 


Since  DN  must  be  assumed  to  vanish  for  large  p,  and  r,  the  first  and 
third  terms  in  the  brackets  integrate  to  zero.  The  second  term, 
because  of  the  symmetry  of  DN  can  be  written  in  the  form : 


V(N  -  s)  ^ 

j  dx,t  [  0),,  +  i 

• 

dx,,2 

dxN  Ds 

1-1 

J  J 

N  -  s  ± 

V  ^ 
r  1-1  * 

jdx,.i 

♦  1^*  ♦  iC*i*  * 

’  '  »  *14.  It 

t). 


Hence  in  the  limit  N  ->  oo,  V  ->  co,  v  =  V/N  and  s  fixed  one  obtains 
the  right-hand  side  of  (10). 
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Since  we  shall  be  especially  interested  in  the  cases  s  =  1  and  s  -  2, 
let  me  write  out  these  cases  more  explicitly.  One  gets: 

(8)  TF  +  m  If  =  l  9MXl' 

nni 

dx3(0l3  +  033)F3(Xi,  x3,  x3,  t). 

Note  that  (11a)  already  smells  a  bit  like  the  Boltzmann  equation. 
But  of  course  the  connection  is  still  far  from  clear. 


(b) 


dh  +  /Ei.  JL  +  Ee.  JL 
dt  \m  brx  m  tr3 

-li 


3.  The  ideas  of  Bogoliubov.  The  Liouville  equation  (3)  or  the  hier¬ 
archy  (10)  to  which  it  is  equivalent  embodies  the  mechanical  assump¬ 
tions.  Knowing  DN( 0)  at  t  =  0,  (3)  will  tell  us  in  principle  what  DN 
is  at  any  lat> :  :me.  B  ;t  how  does  one  know  Ds( 0)?  The  answer 
is,  I  think,  that  >ne  is  i  nly  interested  in  those  phenomena  which  are 
independent  of  Ds( 0),  or  rather  depend  only  on  a  few  average  values 
over  Dn( 0).  To  see  how  such  a  description,  essentially  independent 
of  Ds(0),  can  come  about  we  have  first  to  look  at  the  basic  relaxation 
times.  They  are: 

(a)  the  time  of  a  collision  r  ~  r0/i\v ,  r0  =  range  of  the  forces  (which 
from  now  on  we  will  assume  to  be  purely  repulsive)  and  t>»v  is  some 
average  thermal  velocity. 

(b)  the  time  between  collisions.  t0  =  A/rlv;  A  =  mean  free  path. 

(c)  the  macroscopic  relaxation  time.  To  =  LjV3\  L  is  a  macro¬ 
scopic  length  and  V0  is  the  sound  velocity  ( 2  rav). 

In  the  usual  situations:  slowly  varying  macroscopic  phenomena,  not 
too  dense  gases  (therefore  not  for  liquids  for  which  r0  2  A),  one  will 
have: 

(12)  r  t0<^  T0. 

Now  Bogoliubov  says  that  one  can  expect  that  after  an  initial 
chaotization  time  of  order  r,  and  for  any  DN( 0)  a  stage  is  reached — 
Bogoliubov  calls  it  the  kinetic  stage — in  which  the  further  temporal 
development  of  the  gas  is  determined  completely  by  the  temporal 
change  of  the  first  distribution  function  Ffx,  t)  which  in  turn  is 
governed  by  an  equation  of  the  form: 


(13) 


=  A(x\Fx) 
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where  A  depends  functionally  on  Fx  but  does  not  depend  on  the  time 
explicitly.  All  the  higher  distribution  functions  F,  depend  on  the 
time  only  through  Fx  and  have  therefore  the  form: 

04)  F,  =  F.(xx,  •  •  •,  jcJFJ  (j>2). 

To  elucidate,  note: 

(a)  Eqs.  (13)  and  (14)  represent  a  contraction  of  the  description  of 
the  state  of  the  gas,  which  is  quite  analogous  to  the  Chapman-Enskog 
solution  of  the  Boltzmann  equation  which  describes  the  second  or 
hydrodynamical  stage  of  the  total  relaxation  process.  Eq.  (13)  is 
analogous  to  the  Ansatz  for  the  hydrodynamical  equations  (Eq.  (22), 
Chapter  VI),  and  Eq.  (14)  to  the  Ansatz  for  the  solution  of  the  Boltz¬ 
mann  equation  (Eq.  (21),  Chapter  VI)  which  we  discussed  in  the 
previous  chapter. 

(b)  Just  as  in  the  hydrodynamical  stage  the  contraction  is  coupled 
with  an  expansion  in  the  uniformity  parameter  0  ~  t0/T9 ,  in  the 
kinetic  stage  the  contraction  is  coupled  with  an  expansion  in  powers  of 
\/v  (virial  expansion).  Bogoliubov  puts: 

(15)  /t(jc|F,)  =  /4(0)(x|F,)  +  I  A“\x\Fx)  +  ~  A«\x\Fx)  +  •  •  •, 
and 

(16)  F,  =  F,°\xx ■  ■  *.| Fx)  +  i  F,"(xx  x,\Fx)  +  •  •. 

Note  that  the  dimensionless  parameter  in  (15)  and  (16)  will  of  course 
be  of  order  rl/v  =  r%n  =  rj A  since  A  ~  l/wg.  Hence  just  as  in  the 
hydrodynamical  stage,  the  development  parameter  can  be  looked  upon 
as  the  ratio  r//0  of  the  two  relaxation  times  in  question.  However  in 
practice  it  is  easier  to  use  \/v. 

(c)  The  reason  that  in  the  kinetic  stage  F,  is  the  basic  “secular” 
variable  which  governs  the  temporal  development,  is  because  of  the 
fact  that  the  intermolecular  forces  do  not  affect  F,  directly.  Only  for 

2  does  the  streaming  term  X,F,  contain  the  intermolecular  force 
so  that  F,  (s  5  2)  will  change  quickly  in  a  time  of  order  r  while  the  Fx 
will  stay  constant.  In  the  same  sense  therefore  as  in  the  hydro¬ 
dynamical  stage  and  in  the  time  scale  t0  the  macroscopic  quantities 
P ,  u  and  T  could  be  considered  constants  of  the  motion,  so  in  the 
kinetic  stage  and  in  the  time  scale  r  the  F,  can  be  considered  to  be  a 
constant  of  the  motion. 

(d)  The  connection  with  the  general  Boltzmann-Gibbs  picture  of 
the  approach  to  equilibrium  is  I  think  clear.  The  contraction  is 
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similar  to  the  description  through  the  “macroscopic”  variables 
yu  >a.  •  •  •  except  that  again  the  choice  of  Fi  is  determined  by  the 
system  and  not  by  a  macroscopic  observer,  and  that  one  obtains  at 
the  same  time  a  closed  deterministic  description  through  the  kinetic 
equation  (13). 

Of  course,  all  this  is  a  program.  We  will  now  proceed  to  work  it 
out,  and  then  you  will  see  also  some  of  the  difficulties. 

4.  Determination  of  the  functionals  F,tl\  Substituting  the  virial  expan* 
sion  (16)  for  s  -  2  in  the  first  equation  of  the  BBGKY  hierarchy  (10) 
one  obtains: 

+  p  jdxa  9iaFau(x fi,  -Yal^i)  +  •  •  •• 

Hence  comparing  with  the  kinetic  equation  (13)  and  using  the  virial 
expansion  (15)  for  A(x\Fx),  one  gets: 

(17a)  ^’Or.lF.)  =  -  =  -XxFl 

m  OTj 

(17b)  /ftv.l^)  =  jdx2  BnFMxu 

and  in  general : 

(17c)  ^'’(.t.lF.)  =  jdx a  etaF^-l\Xl,  x2\ F,). 

Since  for  s  ^  2,  F,  depends  on  time  only  through  its  functional  depen¬ 
dence  upon  Fu  its  time  derivative  can  be  expressed  in  terms  of  F% 
through  the  kinetic  equation  (13).  Indeed,  for  any  functional 
*Hx i,  xa  ■  ■  |F|)  which  depends  on  time  only  through  Fu  ty/dt  can  be 
expressed  as  a  new  functional  of  Fi  by  forming  the  formal  derivative  of 
4>  with  respect  to  t  and  then  replacing  dFJdt  by  A(x\Fx)  wherever  it 
occurs  in  the  resulting  functional.  If  in  addition  we  introduce  the 
virial  expansion  (15)  for  A(x\Fl),  it  is  clear  that  one  can  write: 

(18)  {jiHxux2---Fl)  =  Dt0,</i  +  I  +... 

where  the  formal  operator  DH)  consists  in  forming  the  derivative  of  ^ 
with  respect  to  t  and  then  replacing  dFJdt  wherever  it  occurs  by 
/^'’(jcIFi).  Of  course,  the  argument  x  of  Ain(x |F,)  must  be  taken  as 


DETERMINATION  OF  THE  FUNCTIONALS  Fi°  125 


the  argument  of  dFi/dt  as  it  occurs  in  the  differentiation  process. 
Hence,  substituting  the  virial  expansion  (16)  for  F,  into  the  hierarchy 
(10)  for  s  >  2  and  equating  equal  powers  of  \/v  one  gets: 

(19a)  D,0)F}0)  +  X  fF<°>  =  0 

and  for  /  £  1 : 

(19b)  DmF«'  +  X ,Fl"  =  -  f  +  f dx,+i  f 

l-l  J  1-1 

These  equations  together  with  (17)  are  functional  equations  for  F,m 
assuming  that  Fx  is  a  given  function,  which  is  in  principle  arbitrary. 
Presumably  (19a)  will  determine  Fl0)  and  therefore  especially  F£°\ 
which  then  according  to  (17b)  enables  one  to  find  Aa>.  This  in  turn 
allows  one  to  find  D<UF!0>  and  then  (19b)  for  /  =  1  will  give  F,(l) 
which  leads  to  A,3\  and  so  on. 

We  now  have  to  discuss  in  which  sense  the  functional  equations  (19) 
determine  the  functionals  F},}.  We  will  assume  therefore  that  the 
right-hand  sides  of  the  equations  (19)  are  known  functionals  of  Fx  and 
we  will  try  to  find  F!l). 

The  trick  Bogoiiubov  uses  to  solve  a  functional  equation  of  the 
form: 

(20)  D'O'F',1'  +  X,Fl"  =  ft"(* ,  -  ■  xjF,) 

where  is  a  known  functional  of  Fu  is  the  replacement  of  the  formal 
operator  D,0>  by  an  ordinary  derivative  with  respect  to  an  auxiliary 
time  variable  r.  To  explain  this,  we  introduce  the  ^-particle  streaming 
operator : 

(21)  Sfc(jr,  xk,  r)  =  exp{rXk(xr  ■  •.*■*)}. 

It  is  the  time  displacement  operator  over  the  time  r  due  to  the 
streaming  of  the  k  particles  in  their  6£-dimensional  phase  space  under 
the  influence  of  their  mutual  interactions.  If  (r,,  p,)  is  the  phase  of 
the  ith  particle,  then  in  the  motion  of  the  k  particles  the  phase  of 
particle  /  a  time  r  later  is  5v(r)r,,  S^rjp,,  which  is  a  function  of  the 
phases  xx  -  ■  xk  of  all  k  particles,  although  usually  we  will  not  put 
this  in  evidence.  If  x(*i  •  ••  xk)  is  an  arbitrary  function  of  the  phases 
of  the  k  particles  then : 

(22)  ^(rJxfATi  ••  **)  =  x(S*(T)*i>  5k(r)jf2  •  Sk{r)xk), 

Clearly  the  Sk(r )  form  an  additive  Abelian  one  parameter  group  of 
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operators;  5k(ri)-5k(ra)  =  Sk(r,  +  ra);  Sk(  - r)  is  the  inverse  of 
5k(r)  and  Sk(0)  =  I.  Especially: 

(23)  S,( r)x(x,  t)  =  e'*>x(X'  I )  =  x(r  +  J  r,  p,  /)• 

To  replace  the  formal  operator  f><0)  by  an  ordinary  derivative  we  first 
note  that : 


£  SMF^x,  t)  =  Xle'*>Ft  =  X.SMF, 

(24) 

=  -^,0,(jr|51(r)Fl) 

using  the  expression  (17a)  for  A(0\  Hence  from  the  definition  of  the 
operator  D<0>  it  follows  that: 

(25)  D«"F“'(Xl  x.lSMF,)  =  -  I-  Fl"(Xl  ■  •x,|S1(r )F1). 


Since  the  function  FAx,  t)  in  (20)  is  arbitrary,  we  can  replace  it  by 
Si(r)Fi  and  using  (25)  it  follows  that  (20)  is  equivalent  to  the  differ¬ 
ential  equation: 


(r)F,) 

’Or 

(26) 


=  -#'>(*!  •■•*.|Sl(r)F1). 


So  far  nothing  has  been  assumed;  one  has  only  transformed  (20)  into 
(26)  because  it  is  simpler  to  treat  ordinary  differential  equations  than 
functional  differential  equations.  However  in  the  form  (26)  one  sees 
that  more  information  is  needed  to  find  the  F,‘\  Clearly  some  sort 
of  initial  or  boundary  condition  is  required.  Bogoliubov  assumes  that 
for  any  of  the  functionals  F,  for  i  ^  2: 


(27) 


Lim  S,(  -  r)F.(.v,  •  x>\ 5,(r)F,) 


=  Lim  S,(-t)  PI  SMFdx,,  t). 

T-®  1 


The  basic  nature  of  this  assumption  should  be  emphasized.  What  it 
means  is  most  easily  seen  by  considering  the  spatial  uniform  case, 
since  then  (see  Eq.  (23))  the  5t(r)  operator  can  be  omitted  and  (27) 
becomes : 

s 

Lim  S,(~r)F,  =  Lim  5,(-r)  f  \  F,(p„  t). 

t  -  «  t  «  |s  ] 

We  can  say  therefore  that  if  one  traces  the  s  particles  back  in  the  past 
where  they  will  be  far  apart  from  each  other  because  of  the  repulsive 
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forces,  we  assume  that  their  momenta  are  uncorrelated  so  that  the  F, 
is  a  product  of  /Vs.  This  is  not  obvious!  One  might  for  example 
just  as  well  think  that  the  same  type  of  independence  would  hold  for 
the  future,  that  is  for  r  -»  -  oo.  We  shall  soon  see  that  this  is  not 
acceptable  since  it  would  lead  to  a  Boltzmann  equation  for  which  the 
direct  and  restituting  collisions  are  interchanged  so  that  the  //-function 
would  increase  instead  of  decrease.  It  is  the  boundary  condition  (27) 
which  selects  the  appropriate  irreversible  solutions  of  the  BBGKY 
hierarchy  (10)  of  the  form  (13)  and  (14).  The  reason  why  such  a 
condition  is  necessary  while  in  the  corresponding  Chapman-Enskog 
development  there  was  no  need  for  an  analogous  condition,  is,  I  think, 
because  the  Boltzmann  equation  is  already  irreversible  while  for  the 
Liouville  equation  (and  therefore  for  the  hierarchy)  one  must  choose 
the  direction  of  time  for  which  the  continuing  phase  mixing  takes 
place.3 

Accepting  the  boundary  condition  (27),  let  us  see  how  the  functionals 
F',h  can  then  be  determined.  First  of  all  from  the  virial  expansion 
(16)  of  the  F,  it  follows  that  (27)  implies: 


f(a) 


(28)  < 


(b) 


Lim  S,(-t)/',,0,(a’1-  •  x,\Si{r)Fi) 

t  •*  m 

=  Lim  S,(  —  r)  S,(r)/'1(jrl,  /), 

I  •  *  |  _  1 

Lim  S.(-r)/r»(jc,  v.lS.MF,)  =  0  (/  *  0). 

t  -  to 


Start  from  (26)  and  take  first  /  =  0,  in  which  case  the  right-hand  side 
=  0  (see  Eq.  (19a)).  The  “solution”  of  (26)  is  then: 


/'.,0,(-v,  •  •  *.\SAt)Fi)  =  S,(t)F:0'(Xi  •  -jc.| Ft). 

Or: 

Ft0>(X\  x,\F})  =  S.( -t)/7°’(.v,  •  -x,\Si(t)F ,). 

The  left-hand  side  is  independent  of  r,  so  one  can  go  to  the  limit 
r  -*•  oo  and  from  the  boundary  condition  (28a)  one  then  finds: 


(29)  /■.,0,(-v1  -jr,| /',)  =  Lim  5,(  -  r)  f]  SMFJx,,  t). 

»-«>  i=  i 

To  find  F',"  for  /  >  0  from  Eq.  (26)  one  argues  in  the  same  way. 
From  the  general  formal  “solution”  of  (26)  one  gets: 

FH*.  -Jr.lF.)  =  S,(-r)F«''(.vr  •  •jr.|S,(T)FI) 

+  f  dr,  S.i-rMWx,-  ■  X.lSArJF,) 

Jo 


128  THE  KINETIC  THEORY  OF  DENSE  GASES 


and  by  applying  the  boundary  condition  (28b)  for  r  -*■  oo  one  sees  that 
the  first  term  on  the  right-hand  side  is  zero,  so  that : 

(30)  Fi'Xx, ■  ■  -x.\ f,)  =  Jo"  dr,  SA-uMXxi ■  ■  •Jti|S,(r1)F|). 

Using  (29)  and  (30)  the  functionals  Fin  can  be  determined  successively 
from  the  basic  equation  (19). 

5.  The  kinetic  equation.  Let  us  now  write  out  according  to  our 
recipe  the  first  few  terms  in  the  kinetic  equation.  To  simplify  the 
formula,  let  me  introduce  the  abbreviation: 

(31)  ■'/’,(x1  x,)  =  Lim  S,(j r,  ■  •  •  x„  -r)  f]  S,(a„  t) 

»-•  i-i 

since  this  combination  of  streaming  operators  occurs  all  the  time. 
From  (29)  one  gets: 

Fa°\x,xa\F i)  =  ffa(xx xa)F,(xu  t)F,(xa,  l). 

Hence  substituting  in  ( 1 7b),  one  can  write : 

^ll>(Aj|Fi)  =  f dxa  &,a y  3F i(Aj,  t)F i(jc2,  I) 

(32) 

=  j  j  dc,  di2  f,u„  /)F,(i2.  tmXl  |{lf  c2) 

with 

(33)  Q3(xi  Kj*  Ca)  =  j dxa  0 ia'fa{x,,  xa)&(x,  —  {,)S(jra  —  £2). 

Here  (,  =  (§,,  tj,)  corresponding  to  =  (r„  p,).  Clearly  the  two 
lines  in  (32)  are  equivalent  with  the  definition  (33)  for  Qa.  The 
reason  for  writing  An>  in  the  form  (32)  is  in  order  to  separate  the 
dependence  on  the  distribution  F,  from  the  mechanics  of  the  binary 
interaction  which  is  embodied  in  the  function  Qa. 

Turning  now  to  Ali)(x,\F,),  we  have  first  to  find  Fa\  which  means 
according  to  (30)  that  we  have  to  determine  From  (29)  and  the 
definition  of  the  operator  Dn\  one  gets,  using  the  first  line  of  (32) 
for  A"’: 

D"'Fl°'(x„xa\Ft)  =  ya{FAxl,t)A'''(xa\Fl)  +  A^x^F^x,,  l)} 

=  //2(l,  2)  J dx3  {»13.'/'2(l,  3)  +  023.f/2(2.  3)}  jQ  F,(xh  0 
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where  we  have  omitted  the  jr’s  in  the  arguments  of  the  streaming 
operators  and  only  indicated  the  particles  involved.  Next  according 
to  (19b)  we  need  F^0)  which  according  to  (29)  is  given  by: 

FfXxu  xa,  xa\Fy)  =  AO.  2.  3)  n  fi(*.  ')• 

I-  1 


Altogether  one  gets : 

M'(xu  xa\F>)  =  f dx 3  [(©l3  +  ea3).r3(l,  2,  3) 

(34) 

-  AO.  2){613  A(1.  3)  +  »23  A(2.  3)}]  fl  Fi(xt,  /)• 

1*1 

Putting  this  in  (30)  (taking  /  =  I,  s  =  2)  gives  Fll\ xu  JCalF,)  and  from 
(17)  one  then  finds  A <2>.  Note  that  i//a,>  and  therefore  also  Fal>  is 
symmetric  in  xit  xa.  Separating  as  in  (32)  the  distribution  functions 
Fi  from  the  dynamics  of  the  triple  interactions,  one  can  finally  write: 

(35)  A«\ xi  IF,)  =  J  J  jdii  dla  dl3  J1  OBAxMu  ia,  C3) 

with: 


fl3(*,l£„  £a.  £s) 

dr  Sa(l.  2,  -r)  f dx3  f  ]  f|  S,(x„  t)8(x,  -  {,). 

J  i«  1 

where  (  ]  stands  for  the  expression  in  square  brackets  in  Eq.  (34). 

I  think  this  shows  sufficiently  how  the  successive  approximation 
method  works,  and  how  one  can  determine  successively  the  functionals 
F,<n  and  A<k).  Especially  one  will  get  for  the  kinetic  equation: 

TT  +  m  Tif  =  l  J /*•  dU  £a)f*££>-  0Fi((a.  /) 

(37)  +  i  f23(x,|£,,  C2.  Ca)  I~I  F,((|,  /)  +  • 

which  shows  explicitly  the  successive  effects  of  the  binary,  ternary  •  •  • 
interactions. 

I  will  now  leave  the  formal  machinery,  and  will  try  to  discuss 
whether  the  results  make  sense! 


6.  Discussion  of  the  kinetic  equation.  There  are  a  number  of  require* 
ments  which  one  would  expect  that  the  kinetic  equation  should  fulfill. 
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(a)  In  the  first  place  one  would  expect  that  the  first  term  on  the 
right-hand  side  of  (37)  is  equivalent  to  the  Boltzmann  collision  term. 
I  will  now  show,  following  Bogoliubov,  that  this  is  exactly  the  case  for 
spatially  uniform  systems. 

Proof.  Since  in  the  spatially  uniform  case  Fx  depends  only  on 
p  and  t  one  can  omit  in  '/'2  the  Sx  operators,  and  since  S2(-t)  acting 
on  p,  or  p2  only  involves  the  relative  coordinate  r12  =  r2  -  ri,  one 
easily  sees  from  the  first  line  in  (32)  that  A'1'  becomes: 

(38)  /f",(p,|f1)  =  jdrX2jdp2el2Fl(Plxa\t)Fl(P?\t) 
where : 

F,2)  =  Lim  S2(.v,.  x2,  -r)p„  (i  =  1,  2) 

1  —  V, 

are  the  constant  initial  momenta  in  the  binary  collision  governed  by 
the  Hamiltonian  H2  which  leads  to  the  phase  xx,  x2  at  time  zero.  The 
F,2)  are  functions  of  pi,  p2  and  r,2.  Since  in  terms  of  the  momenta 
F,2’  the  Hamiltonian 

Hl  =  i(p?  +  pi)  +  +'2  =  2L<(P,2,)2  +  (P22>)2> 

it  follows  that : 

(39)  \H2,  F|(F,2’,  t)Fl(P'2\  /)}  =0. 

This  is  clearly  so  if  H2  is  expressed  in  the  F,2>,  but  then  it  also  must 
be  so  if  H2  is  expressed  in  the  original  variables  xx,  x2,  since  the 
transition  from  jr,,  x2  to  the  F,2’  is  a  contact  transformation  and  the 
Poisson  bracket  is  invariant  under  such  transformations.  Written 
out  in  the  variables  *i,  x2,  Eq.  (39)  implies: 

t)Fx( F22\  i)  =  F,(F,2>,  t)Fx(V*£\  t). 

m  c’rx2 

Putting  this  in  (38),  and  using  in  the  r,2-integral  cylindrical  coordinates 
with  the  axis  in  the  direction  of  the  relative  velocity  g  =  (p3  -  Pi )/m, 
Eq.  (38)  becomes: 

(40)  A" 1  >(p, | Fx)  -  J dp2  £’  gb  db  dr  J  *  *  dl  |  Fi(F,a>,  i)Fx(P?\  t ) 

where  /  is  the  coordinate  along  the  cylindrical  axis  and  ( b ,  e)  are  the 
polar  coordinates  perpendicular  to  this  axis.  Clearly  the  /-integral 
can  be  performed.  Since  at  /  ►  -  oo  the  two  particles  are  outside 
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their  action  sphere  and  since  the  Sa(-r)  will  never  bring  them  to¬ 
gether: 

Lim  Pi3)  =  p,.  (/=1,2). 

I-  -  *> 

On  the  other  hand  for  /  ->  +  qo,  although  the  particles  are  then  also 
outside  their  range  of  interaction,  the  Sa(xuxa,  -r)  operation  for 
t  -*  oo  will  certainly  produce  a  collision  and  therefore: 

Lim  P‘,2*  =  p|,  0=1,2) 

I  -  +  r> 

where  the  pj  are  the  momenta  of  the  restituting  collision  (pi,  pa)  -*• 
(Pi,  pa).  Introducing  again  the  differential  cross-section  Kg,  6)  by: 

hdbdr  =  Kg,0)dQ 

(40)  becomes: 

/^’(PilF,)  =  jdp2  jdQgKg,  0)[F,(p'„  /)F,(pi.  I)  -  Fj( p„  t)F ,(pa>  /)] 

which  is  the  Boltzmann  collision  operator. 

For  systems  which  are  not  spatially  uniform  the  proof  breaks  down 
and  in  this  case  the  Boltzmann  form  of  the  collision  operator  can  only 
be  an  approximation  of  the  /4a,(jc|Fi).  In  a  sense  this  is  satisfactory 
since  in  the  intuitive  derivation  of  the  Boltzmann  equation  it  is  clear 
that  some  uniformity  assumptions  were  really  implied.  For  instance 
the  difference  in  position  of  the  colliding  particles  was  neglected  and 
it  was  assumed  that  the  changes  of  F,  in  time  due  to  the  streaming 
and  due  to  the  collisions  are  strictly  additive  and  do  not  influence 
each  other.  Clearly  both  these  assumptions  can  only  be  valid  if  the 
spatial  variation  of  Fx  is  small,  and  it  is  therefore  satisfactory  that  the 
Bogoliubov  theory  gives  a  more  general  form  of  the  binary  collision 
operator. 

(b)  The  collision  terms  A"\  A'2'-  •  •  must  have  the  cluster-property, 
which  means  that  they  must  be  zero  if  the  particles  do  not  make  a 
binary,  triple  -  ■  collision.  For  A'l)  this  is  obvious,  because  if  the 
two  particles  stay  outside  their  action  sphere  then  the  operator  @,a 
will  make  Qa  and  therefore  A'"  zero  (see  Eq.  (33)).  For  A'2)  it  can 
be  verified  as  follows.  From  Eq.  (36)  for  one  sees  that  the  inte¬ 
grand  of  the  or a-integral  will  vanish  if  |r2  —  r,  |  >  r0  because  of  the 
@12  operator.  Consequently  one  needs  to  consider  only  those  xa  for 
which  |ra  -  r,|  <  r0.  For  this  phase  U,,  xa)  the  Sa(l,  2,  -r)  opera¬ 
tor  separates  the  particles  I  and  2  by  a  distance  r„  in  a  time  of  the 
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order  of  the  collision  time  and  from  that  time  on  the  distance  of  the 
particles  increases.  Put : 

X,  =  S2(l,2,  -t)x„  (i  =  1,2) 

and  call  r  the  time  for  which  the  distance  |r3  -  ra  |  has  become  larger 
than  2r0.  The  operator  occurring  in  the  r-integral  in  (36)  can  now 
be  written  as: 

)  dx3  [{013(ii,  Jf3)  +  Oa3(xa,  x3)}  ¥3(Xu  X2,  x3) 

(4I) 

—  ^a){®j3(^i,  x3)  r/'2(xlt  x3)  +  ©a3(xa,  x3).</> a(£a,  jc3)}]. 

For  r  >  t',  0,2(jE,,  Jfa)  =  0  and  for  fixed  x3  either  ©t3(j?t,  or3)  or 
®2a(J?a.  x3)  must  be  zero,  since  J?i  and  Jfa  are  separated  by  a  distance 
bigger  than  2r0.  Suppose  Sl3(xt,  x3)  =  0,  but  8a3(j?a,  *3)  /  0. 
From  the  definition  of  ■'/’3,  and  since  for  this  configuration  X3  = 
X2(X2%  x3)  +  Tf  it  is  clear  that  one  then  has: 

■'’■AXu  x2,  x3)  =  2(x2,  x3) 

and  since  in  this  case  ‘/’a(£),  x2)  =  1  one  sees  that  the  operator  (41) 
goes  to  zero.  This  is  also  the  case  if  ©a3(jta,  jc3)  =  Obut  ©i3(jfi,  x3)  #  0 
by  a  similar  argument,  and  hence  one  can  conclude  that  Q3  is  only 
different  from  zero  if  for  a  time  of  the  order  of  the  collision  time  the 
three  particles  are  in  a  connected  configuration. 

It  would  be  very  valuable  if  one  could  further  “understand”  the 
expression  for  Si3,  in  the  same  way  as  one  “understands”  the  Boltz¬ 
mann  form  of  the  binary  collision  operator.  For  instance  one  might 
hope  that  in  the  spatially  uniform  case  one  could  transform  Si3  in  a 
similar  way  as  ii2,  so  that  one  would  see  that  f?3  depends  only  on  the 
momenta  of  the  three  particles  before  and  after  the  triple  collision. 
We  have  not  succeeded  in  doing  this  and  I  doubt  whether  it  is  possible. 
However,  I  still  think  that  it  should  be  possible  to  see  so  clearly  through 
the  structure  of  the  successive  collision  kernels  ii2,  Q3  ■  ■  ■  that  one 
could  imagine  that  with  enough  insight  one  would  have  written  them 
down  intuitively  just  as  Boltzmann  wrote  down  his  collision  term. 

(c)  One  should  expect  that  the  kinetic  equation  will  lead  to  the 
state  of  thermodynamic  equilibrium  and  that  then  Fx  and  all  the 
higher  distribution  functions  will  agree  with  the  results  obtained  from 
the  canonical  ensemble.  It  is  rather  easy  to  verify  that  if  one  takes 
for  Ft  the  Maxwell  distribution: 

FT'(x)  =  (InmkT)-*1*  e-’ll3mkT 


(42) 
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all  the  functionals  /4“,(jr|F*q)  vanish  and  that  then  the  functionals 
F,(x i-  ■  jr.l/T*1)  become  the  successive  terms  in  the  virial  expansion 
of  the  ^particle  equilibrium  distribution  functions.  The  simplest  is 
to  prove  this  successively  and  after  a  few  orders  it  will  then  be  clear 
how  it  goes  on. 

In  zeroth  order  with  (42)  Ai0>  is  obviously  zero  and  since  F{*  is 
spatially  uniform  one  obtains  from  Eq.  (29): 

Fl°>(x ,  -  ■  x,\Fn  =  (2nmkT) ~ 3"*  exp^  -  j 

where: 

Pi"  =  Lim  S,(-r)p, 

X—  CO 

are  the  momenta  of  the  s  particles  before  the  5-tuple  collision.  Since 
the  Hamiltonian  for  the  s  particles  can  be  written  as: 


"•  - S + 1  *»  -  s  i 


XI 


one  can  also  write : 


(43)  F,<0l(x,  ■  ■  ■  x,\ Fn  =  (2nmkT)  ~  3*'2  e  -  »■ lkT 

which  is  the  expected  form  for  the  5-particle  distribution  function  at 
very  low  density. 

In  first  order  one  obtains  from  (43)  and  (17b): 

/T’U.I/r)  =  (2nmkT) -3jdxa8ia  c*p(  - 

Introducing  the  relative  coordinate  r  =  r2  -  r,  the  space  part  of  the 
jc3-integral  will  be: 

which  is  clearly  zero.  Hence  /t<l,(.ti|F*q)  =  0  as  follows  of  course 
also  from  the  Boltzmann  form  for  An\  Since  Aa)  =  0,  Dil)Fl0)  =  0, 
and  since  from  (19b)  and  the  definition  of  in  (20),  restricting 
ourselves  to  the  case  s  =  2,  one  has: 


m*i,  *»\ F,)  =  -  D"'Fy  +  jdx3(Sl3  +  6>33)FJ0> 
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one  obtains  from  (30)  and  (43): 

(44)  m*,.  *.| Fn  =  (2irmkT)~9'2  j dr  Sa(  1,  2,  -r) 


(©l3+ 


From  {//3,  cxp(-H3/kT)}  =  0  follows: 


(«„  +  ©a3)*-,'»,',r  =  -  9ia)e-H^r  +  —  ■  A  e~H*lkT. 

\m  d r,  m  dt 2  J  m  dr3 

Introducing  this  in  (44)  the  last  term  clearly  integrates  to  zero.  Since: 

S2(  1 . 2,  —  t)(—  A  +  h±  -  el3\ 

\m  dr  t  m  dr2  J 

»  S2( 1 ,  2,  -r)3Ta 

=  *,Sa(l,2,  -r)  =  -  ^S2(  1,2,  -r) 


the  r-integral  in  (44)  gives : 

[1  -  S2( 1 ,  2,  -oo)]e'«3"tr. 

Writing  //3  =  Ht  +  p3/2m  +  <f>i3  +  <f>23  and  integrating  over  p3,  one 
gets: 

nl\xuxa\Fr) 

=  (2nmkT)~  3[\  -  S2(  1 ,  2,  -  oo)]eH> kr  ■  j dr3  e-<*.3 + 


Introducing  the  Mayer  function  /,  =  exp(  -  <f>„/kT)  -  1,  one  can 
write: 


J dr3  exp[-(<^13  +  <f>a3)/kT]  —  j  dr3(fi3/23  +  /i3  +  f33  +  1). 

The  last  three  terms  are  constants,  independent  of  (jc,,  x3),  so  that 
[I  -  5a(l,  2,  -  oo)J  acting  on  them  will  give  zero.  The  first  term  is  a 
function  of  |ra  -  r, |,  which  is  zero  if  |ra  -  r,|  >  2r0,  so  that: 

Sa(I,  2,  —  co)  j dt3  fi3  f23  =  0. 

One  obtains  therefore: 


(45) 


x2\Fn  =  (2nmkT)-3e  ^lkT  jdr3f13f2, 
=  F2°\xt,  x2\Fiq)  j dr3  fi3f23 


I 
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which  is  the  familiar  expression  as  derived  from  the  canonical  distribu¬ 
tion.*  By  the  same  argument  as  used  for  An)  it  follows  from  (45) 
and  (17c)  that  ^‘^(jrJfT1)  =  0,  and  so  on! 

(d)  We  have  shown  therefore  that  the  equilibrium  distributions 
are  solutions  of  the  Bogoliubov  equations.  Are  these  solutions  always 
reached  in  time?  Or  in  other  words,  can  one  generalize  Boltzmann’s 
//-theorem?  Unfortunately  there  is  here  a  gap  in  the  theory.  So  far 
we  have  been  unable  to  do  this.  I  feel  sure  that  it  must  be  possible 
but  one  is  hampered  by  the  lack  of  insight  in  the  structure  of  the 
successive  collision  kernels  so  that  it  is  difficult  to  see  how  one  should 
generalize  the  argument  of  Boltzmann. 

7.  Remarks  on  the  theory  of  the  hydrodynamical  stage.  It  would  take 
much  too  long  to  explain  in  detail  the  further  work  of  Dr.  S.  T.  Choh 
on  the  generalization  of  the  Chapman-Enskog  development  to  the 
kinetic  equation  of  Bogoliubov.  Let  me  therefore  only  point  out 
what  is  involved  and  indicate  how  far  the  theory  has  progressed. 

The  first  step  is  again  the  derivation  of  the  general  macroscopic 
equations.  Since  these  equations  are  exact  consequences  of  the 
BBGKY  hierarchy,  and  since  it  may  be  useful  to  have  them  somewhere 
together  1  will  still  write  them  out  in  detail,  and  to  facilitate  the  com¬ 
parison  with  the  analogous  development  in  §2  of  Chapter  VI  I  will 
use  in  parallel  with  Fu  Fa,  ■  the  distribution  functions  /,( r,  v,  t ), 
/„(r„  r2,  v,,  v2,  t)  in  the  velocities  instead  of  in  the  momenta.  The 
connection  between  them  is  given  by: 

/i(r,  v.  /)  dv  =  p  /'i(r,  p.  /)  dp, 

(46) 

NiN  -  I) 

/a(r i,  r2,  v„  v2,  l)  </v,  dv2  =  - p, —  f2(r„  r2,  p„  p2,  /)  dpx  dp2 


and  so  on. 

The  macroscopic  quantities,  which  describe  the  macroscopic  state 
of  the  gas,  are : 

(a)  the  mass  density  p  =  nm,  where: 

«( r,  /)  =  jdv  fiir,  v,  /); 

(b)  the  average  flow  velocity  u(r,  t),  defined  by: 

nu,(r,  t)  =  f«ft  i>i/i(r,  v,  /); 
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(c)  the  internal  energy  density  e(r,  /),  which  consists  of  two  parts: 
(47)  t(r,  /)  =  £kln(r,  /)  +  e^r,  t) 

where: 

(47a)  ekln  =  ^  m  jdv  l/a/,(r,  v,  t) 

with  U,  =  vt  -  u,  -  thermal  velocity  (note  that  ekln  was  called  Q  in 
Chapter  VI),  and  where: 


(47b) 


dr,<H\r  -  r,|)/2(r,  r„  v,  v„ /). 


In  terms  of  these  five  quantities  one  obtains  from  the  BBGKY 
hierarchy  the  macroscopic  equations  in  exactly  the  same  form  as  in 
Chapter  VI,  namely: 


(48)  ( 


with: 
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However  now  the  pressure  tensor  P„  and  the  heat  current  density  q{ 
consist  just  as  the  energy  density  e  out  of  two  parts,  a  kinetic  part 
and  a  part  which  depends  on  the  intermolecular  potential  <f>.  One 
finds: 


(49)  plt  =  ptr  +  />r 

where  Pkln  is  defined  just  as  in  Chapter  VI  by: 


(49a)  P\ Vn(r,  /)  =  m  Jrfv  Ut  UJAr,  v,  i) 

and  where: 


/V(r,  0  = 


r3,  t). 


(49b) 
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Here  /ia(r,,  ra,  t)  is  the  pair  density  distribution  defined  by: 

(50)  «a(r„  r2,  t)  =  j  jdvi  </va/a(r„  ra,  y„  va,  /) 

and  the  positions  r,,  ra  are  related  to  r,  and  the  relative  coordinate 
P  =  ra  -  ti  by: 

(51)  r,  =  r  +  (A  -  p)  r2  =  r  +  A  -■ 

p  P 

Finally  one  has : 

(52)  q{  =  qr  +  qFl 

where  again  <7!“"  is  defined  just  as  in  Chapter  VI  by: 

(52a)  qrn(r,  t)  =  fdv  VtV*fx( r,  v.  /) 

and  where : 

qt°l(r,  l)  =  \  \  dr  1  J  jdv  dvt  UU<M  |r  —  r,|  )/2(r,  r„  v,  v„  t) 

(52b) 

+  \  ^  TP  I  dX  S!1*1  dy^U'  a  +  WA*  *2'  v>’  Va’ ,)' 

In  the  second  line  the  positions  r,,  r2  are  again  expressed  in  r  and  p 
by  (51),  and 

U,  =  v,  -  o,  (/  =  1,  2). 

In  note  5  we  indicate  some  of  the  rather  straightforward  manipulations 
required  to  derive  (48)  from  the  BBGKY  equations.  It  is  also  not 
difficult  to  give  a  physical  interpretation  of  the  potential  energy  parts 
in  Ptl  and  q,,  but  we  will  leave  this  to  the  reader. 

One  now  has  to  use  the  expressions  the  Bogoliubov  theory  gives  for 
the  second  distribution  function  Fa  in  terms  of  Fu  and  then  just  as  in 
Eq.  (21)  of  Chapter  VI  one  must  develop  Ft  in  an  uniformity  para¬ 
meter  0,  postulating  at  the  same  time  hydrodynamical  equations  of 
the  form: 

=  ON'"  +  62N(2)  +  ..., 

Ct 

(53)  =  evt'u  +  o2v,'2)  +  --, 

=  8E'l)  +  e2E,2)  +  ••• 
ct 
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where  the  N(t>,  V(,)  and  £<()  are  functions  of  the  position  r  and  depend 
on  the  time  only  through  their  functional  dependence  on  the  macro¬ 
scopic  variables  n,  u  and  e. 

Unfortunately  the  formulas  become  rather  involved,  and  for  all 
details  I  have  to  refer  to  the  dissertation  of  Dr.  S.  T.  Choh.  In  con¬ 
clusion  let  me  therefore  only  state  what  Dr.  Choh  has  been  able  to 
show. 

1.  In  first  order  of  6  one  gets  the  ideal  or  Euler  hydrodynamical 
equations,  in  which  the  pressure  p  as  function  of  T  and  p  is  given  by 
the  usual  (equilibrium)  virial  expansion  and  where  the  internal  energy 
e  is  given  by  the  corresponding  thermal  equation  of  state. 

2.  In  second  order  of  6  one  gets  the  Navier-Stokes  equations,  or 
in  other  words  one  obtains  Fourier's  law  of  heat  conduction : 


dT 


and  Newton's  law  of  internal  friction: 

Ptl  =  p&n  —  2 P\^D„  —  ^  —  P2Daahit 

which  now  involves  two  viscosity  coefficients. 

3.  For  the  transport  coefficients  p2  and  *  one  gets  virial  expan¬ 
sions  of  the  expected  form  (2),  but  unfortunately  the  actual  values  of 
the  expansion  coefficients  depend  on  the  solution  of  inhomogeneous 
integral  equations  of  the  same  form  as  occur  in  the  Chapman-Enskog 
theory  but  much  more  complicated  since  the  inhomogeneous  part  now 
involves  the  three  particle  streaming  operator.  One  can  show  that 
these  equations  have  an  unique  solution,  but  to  find  the  solution  even 
for  special  intermolecular  potentials  (like  the  elastic  sphere  or  the 
Maxwell  model)  seems  very  complicated,  so  that  no  numerical  results 
have  been  obtained  so  far. 


Notes  on  Chapter  VII 


1.  The  work  of  N.  N.  Bogoliubov  was  published  in  his  book: 
Problemy  Dinamicheskoi  Teorii  v  Statisticheskoi  Fisike  (Moscow, 
1946).  An  excerpt  in  English  appeared  in  J.  Phys.  (U.S.S.R.)  10 
(1946)  26S,  and  a  complete  translation  (by  E.  K.  Gora)  appears  in 
the  first  volume  of  the  “Studies  in  Statistical  Mechanics”  (North 
Holland  Publishing  Co.,  Amsterdam,  1962).  A  short  account  of  the 
theory  was  presented  by  one  of  us  at  the  Boulder  Summer  Seminar  in 
I9S7  and  this  was  published  as  Appendix  1  in  the  book  by  M.  Kac 
( Probability  and  Related  Topics  in  the  Physical  Sciences,  Interscience, 
New  York,  1958).  Further  consequences  of  the  Bogoliubov  theory 
have  been  worked  out  in  the  University  of  Michigan  dissertations  of 
S.  T.  Choh  (The  Kinetic  Theory  of  Phenomena  in  Dense  Gases,  1958) 
and  of  R.  L.  Guernsey  (The  Kinetic  Theory  of  Fully  Ionized  Gases, 
1960).  In  the  text  we  follow  mainly  the  exposition  by  S.  T.  Choh. 

The  work  of  J.  G.  Kirkwood  has  appeared  in  a  series  of  papers  in 
J.  Chem.  Phys.  See  especially:  J.  Chem.  Phys.  14  (1946)  180  and  15 
(1947)  72.  Compare  also  the  account  given  by  J.  G.  Kirkwood  and 
J.  Ross  in  the  Proceedings  of  the  International  Symposium  on  Trans¬ 
port  Processes  in  Statistical  Mechanics  (Interscience,  New  York,  1958). 
In  these  proceedings  one  also  finds  a  paper  by  M.  S.  Green,  which  is  a 
summary  of  his  article  in  the  J.  Chem.  Phys.  25  (1956)  836.  More 
recent  work  of  the  same  author  appeared  in  Physica  24  (1958)  393  and 
in  Vol.  Ill  of  the  1960  Boulder  Lectures  in  Theoretical  Physics  (Inter¬ 
science,  New  York,  1961). 

The  work  of  Prigogine  and  co-workers  appeared  in  a  series  of 
papers  in  Physica  since  1956.  Some  of  the  recent  ones  are  by  I. 
Prigogine  and  R.  Balescu,  Physica  25  (1959)  281,  302,  324  ;  26  (1960) 
145,  529.  Compare  also  the  application  to  the  kinetic  theory  of 
plasmas  by  R.  Balescu  (Phys.  Fluids  3  (1960)  52). 

A  useful  comparison  of  the  different  methods  of  deriving  the 
Boltzmann  equation  has  been  made  by  E.  G.  D.  Cohen  (Physica  27 
(1961 )  163).  In  here  one  finds  also  further  references. 

Finally  one  should  point  out  that  the  first  attempt  of  a  kinetic  theory 
of  dense  gases  was  made  by  D.  Enskog  (Kungl.  Svenska  Vet.  Akad. 
Handl.  63,  No.  4  (1921);  see  also  Chapman  and  Cowling,  Mathematical 
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Theory  of  Non-uniform  Gases,  Chapter  16).  Enskog  always  used  the 
hard  sphere  model  and  he  started  from  the  Boltzmann  equation  in 
which  the  collision  term  was  modified  by  taking  into  account  the 
difference  of  position  of  the  two  colliding  molecules  at  the  moment  of 
impact  and  by  assuming  that  the  influence  of  the  other  molecules  can 
be  accounted  for  by  changing  the  collision  probability  by  a  factor 


where  b  is  the  van  der  Waals’  b,  which  insures  that  in  equilibrium  the 
equation  of  state  is  correct  up  to  and  including  the  third  virial  co¬ 
efficient.  To  this  modified  Boltzmann  equation  the  Chapman- 
Enskog  development  can  be  applied  and  this  leads  to  a  virial  expansion 
of  the  transport  coefficients  of  the  form  (2).  In  fact  one  finds: 


(b) 


i i 
1*0 
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*0 


.  7  b 

'  +  3o;  + 

.  +  gi  +  . 

40  v 


The  Enskog  theory  has  been  discussed  by  many  authors;  see  for 
instance:  H.  B.  Hollinger  and  C.  F.  Curtiss,  J.  Chem.  Phys.  33  (I960) 
1386  and  J.  V.  Sengers  and  E.  G.  D.  Cohen,  Physica  27  (1961)  230, 
where  one  finds  also  further  references.  There  is  no  doubt  that  the 
Enskog  theory  explains  at  least  qualitatively  what  happens  at  higher 
densities.  However,  because  of  the  lack  of  a  systematic  procedure 
and  especially  because  of  the  uncertain  validity  of  the  Ansatz  (a),  it  is 
at  present  not  known  whether  the  numerical  coefficients  in  the  virial 
expansions  (b)  are  correct. 

2.  For  the  derivations  by  Bogoliubov  and  by  Kirkwood  see  the 
references  in  Note  I .  The  work  of  M.  Born  and  H.  S.  Green  appeared 
in  a  series  of  papers  in  the  Proceedings  of  the  Royal  Society.  They  are 
collected  in  a  book  called  A  General  Kinetic  Theory  of  Fluids  (Cam¬ 
bridge  University  Press,  Cambridge,  1949).  For  the  work  of  J.  Yvon 
see  his  monograph  La  Theorie  Statistiques  des  Fluids  in  the  Actuality 
Scientifiques  et  Industrielles  (Hermann,  Paris,  no.  203,  1935). 

3.  The  fact  that  the  application  of  a  boundary  condition  like  (26) 
but  for  r  ->  -  oo  leads  to  the  wrong  sign  in  the  collision  term  of  the 
Boltzmann  equation,  was  pointed  out  by  E.  G.  D.  Cohen  and  T.  H. 
Berlin  (Physica  26  (I960)  717).  In  this  paper  and  also  in  the  article 
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by  E.  G.  D.  Cohen  (Physica  27  (1961)  163)  it  was  pointed  out  that  in 
all  derivations  of  the  Boltzmann  equation  from  the  Liouville  equation 
a  similar  choice  of  the  appropriate  solution  is  made. 

Clearly  a  further  clarification  of  the  role  of  the  boundary  condition 
(26)  can  come  about  only  when  one  sees  in  which  way  the  solution  of 
the  initial  value  problem  for  the  Liouville  equation  reaches  the  form 
(14)  postulated  by  Bogoliubov.  We  hope  and  we  expect  that  this 
insight  will  come  from  the  work  of  M.  S.  Green. 

4.  See  J.  E.  Mayer  and  E.  Montroll,  J.  Chem.  Phys.  9  (1941)  2, 
where  one  finds  the  general  expressions  for  the  Fi,)(xl  ■  •  •x,|F1‘K'). 
For  a  discussion  of  the  virial  expansion  of  the  5-particle  distribution 
function  with  the  help  of  the  theory  of  linear  graphs  as  in  Chapter  II 
see  the  review  article  of  the  authors  quoted  in  Note  6  of  Chapter  II. 

5.  We  will  follow  essentially  the  work  of  J.  H.  Irving  and  J.  G. 
Kirkwood,  J.  Chem.  Phys.  18  (1950)  817.  The  method  is  completely 
analogous  to  the  method  used  in  Chapter  VI  to  derive  the  macro¬ 
scopic  equations  from  the  Boltzmann  equation. 

Start  from  the  first  equation  (11a)  of  the  BBGKY  hierarchy. 
Integrating  over  pi  leads  immediately  to  the  continuity  equation  (48a) 
since  the  right-hand  side  vanishes.  Multiplying  (11a)  by  pjm  and 
integrating  over  pi,  one  obtains  after  some  simple  transformations: 


(c) 


Du, 
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MW 

8xa 


-  J*,  J</v  J dvt  —Mr,  r„  V,  v„  i) 


where  we  have  changed  to  velocity  variables  and  the  distribution  f2 
according  to  (46).  The  question  remains  to  show  that  the  last  term 
in  (c)  can  be  written  as  the  divergence  of  the  second  part  Pff1  of  the 
pressure  tensor.  The  simplest  way  is  to  verify.  From  (49b)  one  gets: 


MW 

'OXa 


2  jdp  Pp*  dP  1  dX  8xa  Wa(ri’  r*’  °- 


Since  nt  is  a  function  of  r  +  Ap Ip  and  p  (see  Eq.  (51)  defining  r,  and 
r2),  it  is  clear  that: 

8na  =  pa  8n2 
d\  p  dxa 

Hence: 

~  \  jdp  p^{"a(r’r  +  p’°  "  ”a(r  “ 
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Changing  in  the  second  term  the  direction  of  p,  and  using  the  sym¬ 
metry  of  in  the  position  variables,  one  gets: 


?PfSl 

tixa 


dp 


«a(r,  r  +  p,  t) 


and  this  is  identical  with  the  last  term  in  (c)  if  there  one  puts  rt  = 

r  +  p. 

To  derive  the  energy  equation  (48c)  one  multiplies  (I  la)  by  pV2m 
and  integrates  over  p(.  Using  (48b)  one  then  obtains: 
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To  obtain  the  transport  equation  of  the  potential  energy  density  we 
have  to  use  the  second  equation  (I  lb)  of  the  BBGKY  hierarchy.  By 
multiplying  (lib)  with  #|r,  -  ra|)/2  and  by  integrating  over  r2,  Pi 
and  p2  one  obtains  after  some  rearrangements  and  renamings  and  after 
changing  to  velocity  variables: 

p  Dt  \  p)  <  xa 
(e) 

=  i  S / Sdy  </v'(r“  “  r,  o)  ^  ^  ri*Va(r>  r»,  V,  v„  i) 

where  qf°uu  is  the  first  part  of  the  potential  energy  part  of  the  heat 
current  density  given  by  the  first  line  in  Eq.  (52b).  Adding  equations 
(d)  and  (e),  and  using  the  expression  (49b)  for  Pf one  gets: 


D  If 
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Finally  it  is  easy  to  verify  that  the  last  term  is  the  divergence  of  the 
second  part  of  given  by  the  second  line  in  Eq.  (52b),  which  con¬ 
cludes  the  proof  of  the  energy  equation. 


Appendix 


Quantum  Statistics  of  Interacting 
Particles 


By  Elliott  W.  Montroll 

1.  The  partition  function.  As  was  pointed  out  in  the  lectures  of 
Professor  Uhlenbeck,  the  calculation  of  thermodynamic  properties  of 
an  assembly  of  particles  can  be  accomplished  by  performing  appro¬ 
priate  elementary  operations  (taking  logarithms,  differentiation,  etc.) 
on  the  partition  function.  In  the  case  of  an  assembly  whose  dynamics 
is  described  by  the  laws  of  quantum  mechanics,  this  function  is 

(1)  Z  =  ]>  exp  - /3£, 

i 

where  the  set  {E,}  is  the  set  of  all  energy  levels  of  the  assembly;  i.e. 
the  set  of  characteristic  values  of  the  Schrddinger  equation 

(2)  //*,  =  E,K 

the  H  being  the  Hamiltonian  operator  of  the  assembly  and  the 
the  wave  functions  or  characteristic  functions  of  H.  The  parameter 
/9  is  1  IkT,  T  being  the  temperature  of  the  assembly  and  k  Boltzmann's 
constant. 

Clearly,  if  one  can  solve  the  Schrddinger  equation  for  the  assembly 
of  interest,  the  calculation  of  Z  is  trivial.  Generally,  ^  is  a  function 
of  3 N  (with  N  -  0  ( I023)  being  the  number  of  particles  in  our  assembly) 
variables,  say  the  position  vectors  of  the  particles  in  the  assembly,  and 
H  is  an  operator  which  depends  on  the  same  variables.  If  one 
postulates  additivity  of  intermolecular  forces,  then 


(3a) 

/,2  N  NS-  1 
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and  W)  is  the  potential  energy  of  interaction  between  the  /th  and 
yth  particles  when  they  are  separated  by  a  distance  ru.  The  only 
interesting  case  of  interacting  particles  for  which  the  Schrddinger 
equation  has  been  solved  exactly  is  that  of  coupled  harmonic  oscillators 
(crystals  undergoing  small  vibrations).  For  all  other  significant 
examples,  approximations  must  be  made.  My  lectures  are  concerned 
with  the  systematic  application  of  perturbation  theory  to  the  problem 
of  the  calculation  of  the  partition  function.  Although  a  considerable 
part  of  the  discussion  will  be  quite  general,  the  example  which  I  hope 
to  analyze  in  some  detail  is  that  of  the  gas  of  ionized  particles.  In 
that  case  <f>  represents  the  coulomb  interaction 

(4)  W)  =  Sf, 

e,  being  the  charge  on  the  yth  particle. 

The  methods  presented  in  the  lectures  are  those  developed  by  J.  C. 
Ward  and  myself.  They  are  closely  connected  with  those  developed 
independently  by  Matsubara;  Huang,  Lee,  and  Yang;  Bloch  and 
DiDominicis;  Gell-Mann  and  Brueckner;  Goldstone,  and  other 
authors.  The  characteristic  feature  of  all  this  work  is  that  perturba¬ 
tion  theory  is  developed  in  a  systematic  way  and  that  diagrams  are 
used  extensively.  These  techniques  were  inspired  by  the  celebrated 
paper  of  Feynman  on  the  theory  of  electrons  and  positrons.  As  the 
development  proceeds,  I  suspect  that  occasionally  you  mathematicians 
will  be  amused  by  the  methods,  but  probably  the  recklessness  of  the 
approach  will  more  often  give  you  a  feeling  of  uneasiness.  A  proper 
mathematical  investigation  of  the  expansion  of  the  partition  function 
into  a  series  of  divergent  cluster  integrals  and  their  recollection  into 
convergent  expressions  would  be  most  welcome. 

Actually,  Martin  and  Schwinger,  Fradkin,  Siegert  and  others  have 
discussed  the  partition  function  by  nonperturbative  techniques  but 
these  lead  to  the  necessity  of  making  other  types  of  approximation. 
Their  approach  will  not  be  discussed  here  but  it  might  be  of  interest 
to  some  of  you. 

The  partition  function  (I)  can  be  written  in  an  alternative  form  in 
terms  of  the  characteristic  functions  {>!>,}  of  (2).  Each  */<  depends  on 
the  schematic  variable 

(5)  r  *  {r>,r3,  •••,!•"}. 

Then 

(6)  Z  =  J  ■  "  J*2  d3Nr  ~  trace  C*P  “  PH- 
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Note  that  the  operator  exp  -pH  plays  a  decisive  role  in  equilibrium 
statistical  mechanics  while  exp  itH/H  is  the  operator  from  which 
dynamics  can  be  developed.  That  is,  the  real  part  of  the  complex 
variable  P  +  ith  ~ 1  is  associated  with  thermodynamics  and  the  imagin¬ 
ary  part  with  dynamics  while  both  the  real  and  imaginary  parts  find 
their  place  in  the  statistical  mechanics  of  nonequilibrium  processes. 
We  start  our  development  with  an  analyses  of  Boltzmann  statistics. 
In  the  case  of  fermions  or  bosons  the  summation  over  j  in  (6)  and  (7) 
is  taken  over  only  antisymmetric  or  symmetric  0's. 

We  define  what  is  technically  known  as  a  “propagator”  by 

K(2,  I)  =  K(r2p2-  r,ft)  =  ( £  e-‘*» 0?(/-i)0Ar2)  if  ft  £  ft 
(7)  |o  if  ft  <  ft 

=  *ft  -  ftl^e-^-V*,^)^) 


0(x)  being  the  Heaviside  step  function 


(8) 

* 

II 

(9) 

d6jdx  m  800. 

The  partition  function  is  then 

(10)  Z  =  J  y  J  K(rp,rO)d3Nr. 

Although  the  case  j ft  <  ft  does  not  appear  directly  in  this  formula, 
the  setting  of  K  =  0  under  this  condition  will  be  convenient  in  the 
perturbation  theory  of  the  calculation  of  Z  since  certain  restricted 
integrations  over  P's  will  be  replaced  by  unrestricted  ones.  Actually, 
an  extra  factor  I  INI  should  be  introduced  into  (10)  because  in  inte¬ 
grating  freely  over  the  r  s,  many  sets  of  possible  positions  differ  from 
each  other  only  through  a  permutation  of  particles  so  that  they  are 
overcounted  by  a  factor  N\.  In  §§  1  and  2  lower  indices  on  r's 
identify  the  value  of  p  with  which  the  r  is  associated;  i.e.  r2  is  the 
r  value  (Eq.  5)  at  ft.  The  upper  indices  denote  particle  numbers. 
Since 

(11)  =  Hra  ~  ri)< 

t 

(12)  H(2)<l>,(ra)  =  £,0,(r2),  and  c>9(x)lc)x  =  &(x) 
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where  H(2)  is  the  Hamiltonian  operator  in  terms  of  the  variables 

(13)  ra  =  {rj, 
we  have 

(14)  +  H( 2))«2,  1)  =  HP2  ~  Pi)  S(ra  -  rx). 

This  equation  is  just  the  Green's  function  form  of  the  Schrodinger 
equation  with  p  replaced  by  itjh.  The  -homogeneous  form  of  this 
equation  is  often  called  the  Bloch  equation.  The  propagator  is  a 
Green’s  function. 

In  the  case  of  free  particles,  the  Hamiltonian  is 


(15)  H0(2)  = 

and  (14)  is  just  the  point  source  diffusion  equation  for  N  independent 
particles  with  a  diffusion  constant  D  =  63/2m.  In  all  cases 

f>(r2  ~  r,)  =  I  ]  6(r£  -  rj)- 

i  =  i 

The  free  particle  propagator,  which  is  the  solution  of 

(16)  +  //0(2)|k0(2,  I)  =  &(P2  —  Pi)  %r2  -  /•>), 

is  then 

(17)  K0(2,  1)  =  0(p2  -  & )  PI  *<5(2,  1) 

i-i 

where  *o(2,  1)  has  two  equivalent  forms  in  the  case  of  an  unbounded 
space 

(18)  *<5(2,  1)  =  J  exp  {ip'  •  (r j  -  r/)*-‘ 

-  ^(P2  ~  PiW)2)  dV 

nQv  =  exp  -  im{r<2  -  r{)2l2h2(p2  -  ft,)} 

'  ;  ■  [2rj7i3(fta  —  Pi)lm]312 


The  form  for  Ko  in  a  finite  cubic  box  of  volume  L3  =  V  with  periodic 
boundary  conditions  is 


x  exp  [i(pi-rl  -  p{  r{)lh  -  (fta  -  ftiXp()3/2m] 
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where  the  summation  proceeds  over  all  momentum  states 

(21)  p  =—•  n  =  0.  ±1,  ±2,  -- 

and  ft,,  P2  is  a  Kronecker  delta.  As  L  ->  oo  we  write 

(22)  d3p  =  ( InhILf 

so  that  the  summation  becomes  a  double  integral.  The  fact  that 

(23)  KP*  ~  Pi)  =  lim 

v-»  ®  (2wn)J 

implies  that  (20)  reduces  to  (18)  in  the  required  limit. 

2.  Perturbation  theory.  With  a  knowledge  of  K0,  we  are  prepared 
to  give  a  systematic  perturbation  theory  of  K.  Let 

H  =  H0  +  H, 

where  H0  is  given  by  (15).  Then  the  differential  equation  (14)  is 
equivalent  to  the  integral  equation 

(24)  K{ 2.  1)  =  *,,(2.  I)  -  f  f  *  *0(2,  3)//,(3)K(3.  \)dwr3dp3. 

JV  J  -  *> 

This  equivalence  can  be  verified  by  applying  the  operator  (r’/r ft  +  //( 2)) 
to  obtain  (in  view  of  (16)  and  (24)) 

(j£+  //(2))*(2.  1)  -  //,(2)*0(2.  I)  +  S(ft  -  /?,)  S(r3  -  r,) 

-  Ht{2)  j  J*  *0(2,  3)///(3)AT(2,  IM3*r3 

~  jy  j  _  ft)  2  —  rs) 

x  //,(3)*(3,  \)d3Nr3dp3 
=  H,(2)K0(2,  I)  +  S(ft  -  ft)  S(r3  -  r.) 

-  H,(2)K{2,  I)  +  H,(2)[K(2,  1)  -  AT0(2,  1)] 
=  5(ft  —  Pi)  8(r 2  —  rt) 

as  is  required.  Since  both  *0(2,  1)  and  *(2,  I)  were  defined  to  vanish 
when  ft  <  ft,  the  limits  on  (24)  could  just  as  well  be  written  as  going 
from  ft  to  pa  so  that  (24)  is  equivalent  to 

(25)  *(2,  1)  =  K0(2,  1)  -  f  f3  AT0(2,  3)//,(3)AT(3,  1  )d3Nr3dp3. 

Jv  Jti 
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This  equation  can  be  integrated  to  yield 

(26)  K(  2,  1)  =  tf0(2,  1)  -  f  fa  K0(2,  3)//,(3)/f0(3,  1)  d3Nrs  dp3  + 

Jv  Jt I, 

f  P  f  f3  Xo(2,  3)//,(3)*o(3, 4)H,(4)K0(4,  1 )  d3*r4  dp,  d™r3  dp3 - . 

JV  Jti  Jv  Jh 

If  H,  is  proportional  to  a  coupling  constant  A,  this  expansion  is  a 
power  series  in  A  and  therefore  a  perturbation  series.  If  we  use  the 
explicit  formula  for  H,  given  in  (3a) 

(27)  //,(3)  =  I  ZW'5) 

k>)  l-l 

and  at  first  restrict  ourselves  to  the  case  N  =  3 

H,( 3)  =  rjj)  +  ftrl  r%)  +  rjj) 
then  (26)  becomes 

(28)  K{ 2.  1)  =  A:0(2.  1)  -  f  fa  K0( 2.  3)Mr33.  r>)  +  r§) 

Jv 

+  <Krt  ri)]K0(3,  1 )  d3Nr3  dp3  +  (9  second  order  terms)-  •  • . 

Each  term  in  this  expansion  can  be  identified  by  a  Feynman  type 
diagram  in  P,  r  space.  The  factor  in  K0( 2,1)  which  represents 
particle  “1"  being  propagated  as  a  free  particle  from  /3,  to  P2  (see 
Eq.  (19)  for  analytical  form) 


Figure  1 

is  represented  by  a  line  from  level  pt  to  level  p2,  while  the  interaction 
<f>(r'3,  rjj)  at  p3  is  represented  by  the  horizontal  line  in  Fig.  1.  The 
zero  order  term  has  the  diagram  given  in  Fig.  2  while  three  parts 


PERTURBATION  THEORY 


149 


contribute  to  the  first  order  term. 


These  are  given  in  Fig.  3. 


I 


r 


Figure  2 


fif 

#3- 

- 


1 


(a)  (b)  (c) 

Figure  3 


Of  the  nine  terms  of  second  order,  three  are  drawn  in  Fig.  4.  These 
are  obtained  by  inserting  one  more  interaction  line  at  04  with  (/S4  <  /53). 


Figure  4 

Three  more  terms  can  be  obtained  from  (b)  in  this  way  as  can  three 
from  (c).  Higher  order  terms  are  obtained  in  a  similar  manner.  It 
is  to  be  emphasized  that  all  interaction  lines  are  “isothermal,”  both 
ends  existing  at  the  same  temperature  (0  values). 
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In  the  partition  function  we  must  set  0,  =  0,  0a  =  0,  rx  -  ra  =  r 
and  integrate  over  r.  Then 

(29)  Z  =  i  K0(rp,  rO)  d°r  -  £  £  d°r  d°r3 

*  J*  dfiaK0(rf},  r3,  0a)  ^  <Hra  ~  ra)^o(r aPa  <  rO)  +  •  ■  •  J- 

All  diagrams  of  the  partition  function  extend  over  the  0  range  (0,  0) 
and  all  diagrams  start  and  end  at  the  same  r  values.  All  points  on 
the  strip  in  (0,  r)  space  which  correspond  to  the  range  (0,  0)  can  be 
mapped  on  a  cylinder  of  circumference  0,  and  in  view  of  the  identity 
of  the  r  values  at  each  end  of  the  range  all  our  particle  lines  form 
closed  curves  on  the  cylinder.  When  periodic  boundary  conditions 
are  applied,  the  cylinder  becomes  a  torus.  We  represent  our  diagrams 
by  the  form  they  have  on  the  cylinder.  The  free  particles  of  Fig.  5 
have  the  representation 

O  O  O  ;  free  particles. 


0 


Figure  5 

The  terms  of  first  and  second  order  in  (29)  correspond  to  the  diagrams 

OjO  O  +  QgD  +  O  QjO  :  one  interaction 

CCD  O  +  CCdT)  +  CUTO  +  6  similar  terms:  two  interactions 

Note  that  in  a  term  of  (29)  in  which  two  particles  are  connected  by 
interaction  lines  and  one  remains  free,  the  appropriate  integral  factors 
into  two  parts,  one  a  triple  integral  over  the  free  particle  position 
coordinates  and  the  other  a  six-fold  integral  over  the  position  co¬ 
ordinates  of  the  interacting  pair. 

Generally,  in  the  case  of  an  assembly  of  N  interacting  particles, 

(30)  ZN  =  1  K0(rf}:rQ)d3Nr  -  jj  d™rd™r3 

x  J  K0(rf3:r 303)  ^  <)>(r3  —  r3)K0(r 303 :  rO)  dfi3  +  •  •  • 
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A  typical  integral  in  the  expansion  might  correspond  to  a  diagram 
with  St  free  particles,  sa  pairs  of  particles  connected  by  interaction 
lines,  sa  triples  connected  by  interaction  lines,  etc.  Then  that  integral 
factors  into  s,  factors  of  integrals  of  individual  particle  positions; 
$a  factors  of  integrals  over  positions  of  pairs  of  particles,  etc.  Of 
course,  any  integral  associated  with  a  diagram  which  corresponds  to  / 
connected  particles  appears  as  a  factor  in  many  terms  in  the  expansion 
of  the  partition  function  Z. 


*3 

*2 

01 

W 


COO-O  O— CCD 


(a) 


(b) 

Figure  6 


(C) 


Now  let  us  consider  the  sum  b,  of  all  integrals  which  correspond  to 
diagrams  of  /  particles  connected  by  any  number  of  interaction  lines 
«>/-!: 

(31)  b,  =  ^  (integral  associated  with  a  diagram  of  /connected  particles) 

x  (number  of  ways  diagram  can  be  constructed  from  /  particles). 

A  remark  or  two  is  appropriate  about  the  integrals.  These  were  first 
defined  so  that  the  interaction  lines  occur  in  an  appropriate  sequence 

(32)  0  <  0,  <  0a  <  •  •  -0*  <  0. 

In  a  very  symmetric  diagram  such  as  that  given  in  Fig.  6,  one  might 
wish  to  integrate  freely  over  all  0's  instead  of  adhering  to  the  con¬ 
ditions  (32).  Then  he  would  have  to  divide  by  A  !  because  an  inter¬ 
change  of  the  0*'s  does  not  change  the  character  of  the  diagram. 
These  interchanges  can  be  made  in  A !  ways.  Also  consider  the  two 
diagrams  6b  and  6c.  These  would  be  recorded  in  b3  as  a  single  diagram 
because  in  the  r  and  0  integrations  of  Fig.  6b,  one  automatically 
arrives  at  a  diagram  which  is  topologically  also  of  the  form  of  Fig.  6c. 
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The  quantity  b,  contains  all  possible  connected  diagrams  with  / 
points,  each  diagram  being  given  its  appropriate  weight.  Some 
examples  are 

bi  =  0, 

(33)  fc2  =  ioo  +  ^  0=0  +  -^  CBD+  -, 

b3  =  \  OOO  +1  ■ 

There  is  a  one-to-one  correspondence  between  every  term  in  (30) 
and  every  term  in 

(34)  Z„  =  2  n 

Sl*l  -  N 

The  factor  A!/j,!.y2!-  •  ■  is  the  number  of  ways  N  particles  can  be 
divided  into  subsets  such  that  subsets  contain  one  particle,  s2  two 
particles,  etc.  The  summation  in  (34)  extends  over  all  partitions  of 
N  such  that 

(34a)  2  ~  bf- 

The  grand  partition  function  of  a  grand  canonical  ensemble  of 
systems  composed  of  any  number  of  particles  of  our  type  of  interest 

(35)  Zc  =  2  z"ZN. 

N  -  1 

This  equation,  when  combined  with  (34)  yields 

(36)  Z0  =  2  2  n  (z'bi )’'//!  =  exP  2  b>z' 

N  =  1  E/It  -  N 

so  that 

(37)  logZc  =  2  biZ1- 

The  logarithm  of  the  grand  partition  function  is  then  the  sum  over  all 
connected  diagrams  containing  any  number  of  particles;  a  factor  zl 
weights  any  diagram  composed  of  /  particles. 

Let  us  now  investigate  the  influence  of  quantum  statistics  on  the 
calculation  of  the  b,' s.  Quantum  statistics  is  a  consequence  of  the 
indistinguishability  of  particles  of  the  same  species.  Two  diagrams 
exist  which  correspond  to  a  pair  of  particles  starting  at  (r\  r2)  and 
ending  at  the  same  points.  Both  are  exhibited  in  Fig.  7.  An  observer 
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who  makes  observations  at  P  -  0  and  p  cannot  distinguish  between 
the  two  cases;  both  must  be  included  in  the  evaluation  of  the  partition 
function. 

When  Fig.  7b  is  mapped  on  a  cylinder  in  (r,  p)  space  particle  1  ends 
where  2  starts  (Fig.  7d)  and  vice  versa  so  that  (7b)  is  equivalent  to 
a  single  particle  which  circulates  around  our  (r,  P)  cylinder  twice, 
or  in  the  case  of  periodic  boundary  conditions  around  our  ( r ,  P)  torus 
twice.  This  discussion  of  indistinguishability  can  be  generalized 
immediately  to  sets  of  any  number  of  particles.  A  set  of  n  particles 
such  that  I  ends  where  2  started,  2  ends  where  3  started,  ■  ■  -  ,n  ends 
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(a) 


(b) 

Figure  7 


(c) 


Figure  7d  Figure  7e 

where  1  started  (see  Fig.  6c)  maps  on  the  (r,  P)  cylinder  or  torus  as 
one  particle  which  circulates  around  n  times,  the  total  length  of  p 
.  space .  spanned  being  np.  We  call  such  a  construct  an  «-toron. 
Should  our  ^articles  be  fermii 's,  a  factor  ( - 1 )  must  be  included  for 
each  particle  interchange  between  P  =  0  and  p  =  p.  This  is  a  conse¬ 
quence  of  the  definition  of  the  propagator  K  (see  Eq.  7).  In  the  case 
of  fermions  the  summation  over  wave  functions  </>,  in  (7)  is  restricted 
to  extend  only  over  the  antisymmetric  wave  functions  of  the  operator 
H.  Hence  the  interchange  in  the  position  of  two  particle  coordinates 
in  one  of  the  s  results  in  a  change  in  sign.  If  at  p  =  pt  in  Eq.  (7) 
the  yth  wave  function  is  /■?,•••)  and  if  at  p  =  pa  two  ra  s  are 
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exchanged,  the  resulting  ^(r3)  changes  sign  as  does  K( 2,  1).  Indeed 
when  (n  -  I)  exchanges  in  sign  occur  as  in  Fig.  6c  (an  w-toron)  one 
must  include  a  factor  (-l)"1  =(-l)*  +  l  with  the  free  particle 
propagator.  No  ( - 1)  factors  appear  in  assemblies  of  bosons  because 
in  that  case  the  summation  in  (7)  extends  only  over  symmetrical  wave 
functions  </</. 

In  the  case  of  N  free  fermions  a  typical  set  of  interchanges  of  initial 
coordinates  r,  at  /?,  to  final  coordinates  r2  at  /32  can  be  represented  by 
a  diagram  similar  to  that  in  Fig.  8.  The  set  of  all  possible  inter- 


Figure  8 

changes  (with  appropriate  signs)  can  be  generated  by  the  Mh  order 
determinant 

(38)  K'*'(r2l S2 ;  rtf, )  =  jp  del*  { K0(r<2(i2 ;  rtf, )}  =  jp  det  (2,  1 ) 

where  the  element  in  the  yth  row  and  kth  column  is  just  the  free 
particle  propagator  (19)  which  takes  a  particle  from  ( r\ ,  /?,)  to  ( r'3 ,  jS3) 
(i.e.  the  function  obtained  by  replacing  r\  in  (19)  by  r\.  This  deter¬ 
minant  has  the  required  antisymmetry  with  respect  to  the  exchange 
of  two  initial  coordinates  or  two  final  coordinates  since  such  an 
exchange  is  equivalent  to  exchanging  a  pair  of  rows  or  columns  in 
the  determinant.  The  symmetrical  boson  propagator  is  (W!)'1 
multiplied  by  the  permanent  of  Ko{.r’2f}2-,  rtf,). 

The  propagator  of  a  set  of  N  interacting  fermions  is  obtained  from 
(26)  by  replacing  K0( 2,  I)  by  (l/N!)  det  (2,  1)  and  all  the  K0( 2,  3)  by 
( 1  IN'.)  det  (2,  3).  The  resulting  fermion  propagator 

(39)  K’i 2, 1)  =  jp  [det  (2.  I)  -  £  j“'  det  (2,  3)//;(3)K0(3,  1  )dr3  dpa 

+  j**  det  (2,  3)//,(3)AT0(3,  4)W,(4)A(0(4,  1)  dr,  df}<  dr3  t//S3 - ]- 

The  antisymmetry  of  KF( 2,  1 )  with  respect  to  interchange  of  the  final 
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r' s  follows  from  the  fact  that  the  final  coordinates  only  appear  in  the 
determinant  factors.  The  antisymmetry  with  respect  to  exchange  of 
initial  coordinates  is  clear  from  the  following  argument  when  H,( 3) 
is  symmetrical  with  respect  to  interchange  of  pairs  of  initial  particle 
coordinates.  Exchange  two  particle  coordinates  in  ru  say  r\,  and  r\ 
in  the  second  term  in  (39).  The  term  K0( 3,  1)  has  the  same  form  as 
it  would  without  the  interchange  if  we  replace  r\  by  r§  and  /•§  by  r\. 
This  does  not  change  the  volume  element  for  integration  or  the  inter¬ 
action  part  of  the  Hamiltonian  H,(3)  (provided  that  H,( 3)  is  sym¬ 
metrical  in  all  particle  variables).  It,  however,  does  interchange  a 
pair  of  columns  in  the  determinant  so  that  the  determinant  can  only 
be  returned  to  its  original  form  at  the  expense  of  changing  the  sign 
of  the  second  term  in  (39).  The  same  argument  can  be  applied  to 
each  term  in  the  perturbation  expansion. 

The  fermion  partition  function  is  then 

(40)  =  —  [  Jdet(I,  \)drx 

-  J  J  det  (1,  3)H,(3)K0(3,  1 )  dr3  dp3  dr,  +  •  •  •]• 

As  in  the  Boltzmann  case  the  partition  function  can  be  expanded  as  a 
sum  of  products  of  integrals  over  connected  clusters  or  subsets  of 
particles.  However,  now  clusters  can  exist  even  in  the  absence  of 
interactions;  the  w-toron  is  a  cluster  of  n  particles  which  are  connected 
by  virtue  of  their  indistinguishability.  In  the  case  of  bosons  the 
determinants  in  (40)  are  to  be  replaced  by  permanents. 

If  one  introduces  an  extended  b,  which  is  the  dll'.)  multiplied  by 
the  sum  over  all  integrals  which  correspond  to  diagrams  of  /  particles 
connected  by  any  number  of  interaction  lines  or  by  indistinguish¬ 
ability  or  both,  one  can  rederive  (34),  or  with  the  introduction  of  the 
grand  partition  function  (37).  The  first  few  b,'s  are  now 

by  =  0. 

(41)  h2  =  ^  GD  +  2  o-o  +  GD  ■**  4  0=0  +  ■  •  • , 

*>3  =  3  (Afl)  +  \  °00  +  +  ^ 

Here  ffi  represents  a  2-toron,  <S>  a  3-toron,  etc. 

The  next  section  contains  a  derivation  of  the  set  of  rules  needed  for 
the  calculation  of  each  component  of  a  b,. 


i 
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3.  Algorithm  for  construction  of  integrate  associated  with  diagrams. 

The  form  we  use  for  the  free  particle  propagator  is  (18) 

(42)  tfo(r203:  Fi/J i)  =  J* exp  [/(r3  -  rj  pfr1  -  -  &)/>*]  </3/» 

and  the  form  we  use  for  the  potential  energy  of  interaction  is  its 
Fourier  integral  representation 

(43)  <Wr2  -  r,)  =  J  «(</)exp  [iq  (r2  -  </fy 

Although  considerable  generality  exists  in  our  development,  the 
example  which  we  frequently  have  in  mind  in  our  discussions  is  the 
coulomb  interaction 

(44)  =  ealr 

( e  being  the  charge  on  any  one  of  our  particles).  It  is  easily  verified 
that  the  appropriate  form  for  u(q)  is 

(45)  u(q)  =  e2j2frn2q2 
since 

(46)  ;  =  |  2^Y  exp  {iq  rh~l)  d'q' 

In  this  example  our  perturbation  theory  is  an  expansion  in  powers  of 

e2. 

In  the  absence  of  any  interaction  the  only  contribution  to  bt  is  an 
/- toron  or  a  free  particle  which  encircles  our  (r,  P)  cylinder  /  times, 
thus  spanning  a  range  //3  of  p  space  in  its  course  which  starts  and  ends 
at  the  same  point  r.  The  appropriate  integrand  for  b,  is  (since 
Pa  —  Pi  =  IP  in  (40)  while  r2  =  rx  =  r) 

(47a)  K0(r,  IP;  r,  0)  =  J  exp  (~iPp2)l2m  d3p, 

which  is  independent  of  r.  We  must  finally  integrate  this  over  all 
possible  starting  positions  r.  This  integration  merely  contributes  the 
volume  of  the  container  since  (47)  is  independent  of  r.  In  the  case  of 
fermions  a  factor  ( -  1)' 4 1  must  be  added.  Hence 

*■  -  n  /  «p  J,p} 

x  {number  of  ways  an  /-toron  can  be  constructed  from  /  particles}. 
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The  ordering  of  our  /  particle  lines,  each  of  length  /3,  around  our 
( r ,  /9)  cylinder  is  equivalent  to  that  of  /  points  connected  in  a  ring. 
The  number  of  different  ways  this  can  be  done  is  (/  -  1)!  Hence 

(47b)  b,  =  (-±-y —  J  exp  ( -  If}p*l2m)  d3p, 

the  upper  sign  being  appropriate  for  fermions  and  the  lower  for 
bosons. 

The  grand  partition  function  and  hence  ( PV/kT )  of  a  set  of  free 
fermions  or  bosons  is  obtained  by  substituting  (47b)  into  (37).  One 
obtains  the  well-known  result 

(48)  PV/kT  =  log  Za  =  +  ^  /- 1  J  ( +  2  exp  -&>2/2m)'  d3p 

=  1  (2^)3  J  log(1  1  -  e-ei,2l2n)d3p 
The  parameter  z  must  be  chosen  so  that 

K  )  p  V  V  tz 

I  f  z  exp  (-jfy>a/2m)  d3p 
~  (2 nfi)3  J  1  +  z  exp  (-fipal2m) 

is  the  preassigned  density  of  our  assembly. 

We  begin  our  discussion  of  the  influence  of  interactions  by  con¬ 
sidering  the  most  elementary  diagram  fragment  with  an  interaction, 
a  free  particle  line  connected  with  a  single  interaction  line  (Fig.  9). 


/*3’r3 

(2)  , 

PZ 

q 

13) 

<l> , 

,P| 

^|-r! 

Figure  9 
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When  such  a  fragment  exists  as  part  of  a  more  complicated  cluster 
integral,  the  contribution  to  log  Za  involves  the  integration  over  all 
points  at  which  interactions  can  occur.  Hence  one  would  have  to 
integrate  in  particular  over  all  values  of  r2.  At  fixed  ra  the  propagator 
of  our  fragment  is  the  product  of  propagators  (1),  (2),  and  (3): 

(5°a)  (I)  ^p  J  exP  M'a  -  »•,)  />,*  1  -  (&  -  ft  V>?/2m]  d3px, 

(50b)  (3)  -  J  u(q)  exp  [iq  {rA  -  ra)h']d3q, 

(50c)  (2)  J  exP  M's  “  ra)-pjr 1  -  (ft,  -  Pt)pH2m}  d3pa. 

The  minus  sign  appears  in  (3)  because  in  the  basic  integral  equation 
(25)  the  interaction  H,( 3)  appears  with  a  minus  sign  so  that  in  all 
iterations  which  occur  in  the  perturbation  theory  the  minus  sign 
always  goes  with  H,  and  therefore  with  idq).  If  we  multiply  these 
three  together  and  integrate  over  all  r2,  this  integration  yields  a  factor 

(51)  J  exp  [ir2  (pi  ~  q  -  p2)h  x]  d3r2  =  (Irrhf  8 (px  -  q  -  p2). 

The  variable  px  represents  the  momentum  of  our  free  particle  while  it 
goes  from  r,  to  r2,  p2  that  is  in  the  transition  from  r2  to  r3  while  q  is 
the  momentum  transferred  from  r2  to  rt  due  to  the  interaction  of  our 
particle  of  interest  at  r2  with  another  at  rt.  Eq.  (51)  is  just  a  state¬ 
ment  of  the  conservation  of  momentum  at  (/-2ft,): 


(52) 


Pt  =  Pi  +  q. 


One  must  also  integrate  over  all  possible  values  of  /ft  at  which  an 
interaction  might  take  place.  This  is  most  conveniently  accomplished 
through  the  introduction  of  an  energy  variable  5  which  is  conjugate 
to  /S  in  a  manner  analogous  to  that  in  which  p  and  r  are  conjugate. 
We  note  that 


(53) 


2  f  g*'*  ds  re—p*iam  if  a  >  0, 
2n  is  +  p2l2m  \0  if  a  <  0. 


This  representation  is  especially  useful  because  it  allows  one  to  inte¬ 
grate  freely  over  the  various  variables.  For  example,  suppose 
interactions  were  to  occur  ai  /9,  and  ft,  in  Fig.  9  as  well  as  at  ft,,  so 
that  if  the  inequality  /ft  <  ft,  <  ft,  were  violated,  the  complete  graph 


•- 
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of  which  Fig.  9  is  a  fragment  would  change  its  shape.  Through  use 
of  a  representation  similar  to  (S3)  for  the  Gaussian  in  (1)  and  (2) 
one  would  be  able  to  integrate  freely  on  pa  from  -  oo  to  +  oo  because 
the  integral  would  vanish  automatically  when  p2  extended  beyond 
its  allowed  range  (pu  jS3). 

We  are  now  in  a  position  to  derive  general  rules  for  the  construction 
of  the  integral  associated  with  an  arbitrary  diagram  such  as  that 
given  in  Fig.  10.  We  do  this  by  first  finding  a  simple  expression  for 


Figure  10 

a  single  toron  fragment  of  such  a  diagram.  Then  we  find  rules  for 
combining  torons.  Fig.  1 1  represents  a  typical  N  toron  with  n 
interaction  lines  at 

(54)  0  <  h  <  pa  <■■■  <pn  <  Np. 

The  momentum  transfer  at  these  points  is 


•  -  .9 *• 

Our  N- toron  has  been  mapped  into  a  circle  of  circumference  N) 3,  the 
first  turn  when  constructed  on  our  (r,  p )  cylinder  is  included  in  the 
range  (0,  P),  the  second  turn  in  the  range  (0,  2/9),  etc.  We  denote  the 
full  propagator  associated  with  the  fragment  in  Fig.  1 1  by 

(55)  F N(r tra  ■  •  • ;  P1P2  •  •  • ). 

The  translational  and  periodic  symmetry  of  a  toron  is  such  that  a 
translation  of  all  interaction  points  by  an  amount  j3  leads  to  exactly 
the  original  diagram  again.  This  would  weight  the  same  diagram 
twice.  In  order  to  avoid  such  a  multiple  weighting,  we  restrict  the 
interaction  Pi  to  the  interval  (0,  P) 

0  <  Pi  <  p. 


(56) 
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Figure  1 1 

Now  our  propagator  (55)  is  the  product  of 

(a)  all  free  particle  propagators  which  carry  particles  from  px  to 
Pt,  Pi  to  p3,  etc.  ( r0  and  p0  being  identified  with  rn  and  pn) 

(57)  I  I  j(2 nh)~3  j  exp  [i(r,  -  r,.x)  p,h -  (P,  -  P,-i)pfl2m]  d3p^j; 

(b)  all  interaction  propagators 

(58)  I  I  f  [-u(q,)\  exp  [iq,  (r)  -  rjfr1]  d3q,; 
t- 1  J 

(c)  ( -  I )v  * 1  for  fermions; 

(d)  zN,  a  weight  r  being  associated  with  each  particle  when  one 
calculates  £  b,:‘. 

One  must  finally  integrate  over  all  points  (r„  p,)  at  which  interactions 
can  occur  on  our  AMorons.  We  have  denoted  the  momentum  of  a 
free  particle  between  the  y'th  and  (j  +  l)st  interactions  by  ph  It  is 
convenient  to  define  the  momentum  p  by 

(59)  p  s  Pn. 

It  is  exactly  the  momentum  of  the  free  particle  which  exists  at  p  =  0. 
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We  begin  our  calculation  of  FN(ru  ■•■,/£)  by  examining  all  the 
terms  in  the  product  of  (a),  (b),  (c),  and  (d)  which  contain  only  ru 
r2%:  ••,/■„.  Integrating  over  r,  and  applying  the  results  (51)  and  (52) 
we  find  that  the  (2*6)" 3  factors  in  (57)  cancel  out  and  the  momentum 
conservation  laws 

(60a)  p, . ,  =  p,  +  q„  j  =  2,  3.  •  •  ■ ,  n, 

(60b)  pn  =  px  +  qu 

also  appear  through  delta  functions  similar  to  that  in  (51).  Each  p, 
can  be  expressed  in  terms  of  p  (see  (59))  and  the  q's  through 

(61)  p,  =  p  +  </,♦!  +  q,t2  +  •••  +  <jf„  j  <  n. 

If  we  add  all  terms  in  (60a)  and  (60b)  we  find 

(62)  qx  +  q2  +  ■  ■  ■  +  qn  =  0. 

This  is  just  a  statement  that  the  total  momentum  leaving  our  N-toron 
through  interactions  is  zero  as  one  would  have  expected. 

We  summarize  the  above  results  by  collecting  all  the  factors  which 
now  contribute  to  (55)  with  the  exception  of  the  j3,  dependent  factors 
of  (57).  The  only  remaining  independent  free  particle  momentum 
variable  is  p\  the  rest  disappearing  through  the  integration  over  the 
delta  functions  which  led  to  (60).  The  condition  (62)  is  expressed 
by  introducing  a  delta  function  factor  8(</i  +  -  +  qn)  into  our 
expression  for  Fs.  Our  required  expression  is 

(63)  -(-:)N  j  ■  ■  ■  j  d3p  d'qx  ■  d3qn  &(qx  +  q2  +  -  +  qn) 

n 

x  I  "J  [  -  u(q,)  exp  (iq,  r',h~l )]  (/S  dependent  factor). 

i  - 1 


We  finish  our  calculation  of  Fs  by  examining  the  dependent  factor 
of  (63).  It  is  the  /3  integral  under  the  restrictions  (54)  and  (56).  The 
/3  dependent  factors  of  (63)  are  just  those  of  (57),  namely  the  Gaussian 
factors.  In  order  to  simplify  the  P  integrations  we  use  the  representa¬ 
tion  (53),  for  then  we  can  integrate  freely  over  the  /3’s  with  the  restric¬ 
tion  (54)  removed  with  the  assurance  that  when  it  is  violated  the 
integrand  will  vanish  automatically.  Our  required  factor  before 
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integration  is  then  (the  S  function  being  defined  by  (53)) 

(64)  S(P g  —  Px «  Pi )S(@$  —  fi3 ;  pt) ■  ■  •  S(pn  —  fin-i*  Pn-\) 

X  S(Nt 3  -  (pn  -  p x):pn)  =  J  WP  +  ft  -  P n  +  1)^n  +  l 

*  ~pi’Pi) 

/-I 

where  ft  *  i  is  defined  to  be 

(65)  ft  4 1  =  Np  +  fa. 

We  have  postulated  the  interaction  with  other  torons  to  occur  at  p 
values  ft,  ft.  •••.ft.  Since  all  interactions  are  isothermal  Pi  must 
correspond  to  the  same  temperature  as  P‘,.  All  levels  ft,  ft  +  P , 
ft  +  2/3,  •  •  • ,  etc.,  are  at  the  same  temperature  on  our  (r,  P)  cylinder 
and  our  isothermal  condition  can  be  expressed  as 

(66)  ft  =  p,  mod P ,  j  =  1,2 ,  •  • 

The  delta  function  of  period  P 

(67)  ft(ft  -  ft)  =  g  2 

vanishes  unless  P‘,  =  p,  mod  ft  On  the  other  hand 

(68)  J\(ft  -  p,)dpt  =  I. 

If  a  factor  ft(ft  -  p,)  is  introduced  into  (64)  for  each  j,  then  the 
resulting  quantity  will  vanish  unless  each  p,  has  a  value  compatible 
with  the  preassigned  ft's.  Our  expression  of  interest  is  then 

(69)  f  8( Np  +  px  -  ft  ^ , )  </ft  + ,  f]  ft(ft  ~  ft)$(ft , .  -  Pi ;  />,)• 

»  /  =  i 

Furthermore,  in  the  calculation  of  a  cluster  integral  one  must  integrate 
over  all  possible  points  at  which  interactions  can  occur;  i.e.  over  all 
Pi  i<Pi<Pi*  i-  Since  our  S  function  was  defined  to  vanish  when 
ft  ,i  <  P,  for  all  y,  we  can  integrate  freely  on  (69)  except  for  Px,  which 
(see  56)  was  limited  to  the  range  (0,  P) 

(70)  f  dpx  f  •  •  •  f  KNP  +  px-  Pn  4 1 )  dp  ft  4  j  dpn ■  ■  dpx 

JO  J  -  rr,  J 

I~I  ■“  Pj)S(Pj +i  —  Pi ;  Pf )• 


X 
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Following  (53)  we  set  (J  =  1,  2  -  •  ■«) 


(71) 


After  substituting  (71),  (67)  and  the  Fourier  integral  representation  of 
the  first  delta  function  of  (69), 


(72)  8^0  +  0,  —  ft,*,)  =  i  r.e™'*'' 

into  (69),  it  becomes 


<7,)  s*JL 

x  1,1  {[&  +  ,5j]  cxp  2ni,,pilp}  J0°  exP  {'&(*-.♦  i  -  *1  - 

*  n  {s  J. .  #>  “p  f«j'  t7)} 

where  tntl  =0.  The  integral  with  respect  to  0,  is  exactly 
(74) 


Hence  we  have  the  energy  conservation  law 


(75a)  */-i  =  s,  +  2irtjp,  j  =  2,  •  •  ,  n, 

(75b)  sn,i  =  sn. 


at  each  interaction  point.  This  is  analogous  to  the  momentum  con¬ 
servation  (60)  except  that  the  quanta  of  energy  transferred  are  multiples 
of  (2 w/0)  since  /,  is  always  an  integer.  We  define  s  by 

(76)  .r  s  .*„  =  iB  +  1. 

If  we  sum  over  j  in  (75)  we  find 

s 

2^ 

(77)  -^r  (f8  +•••+/„)  =  s,  -  s„  =5,  -  sn  „ , 
so  that 


(78) 


Sn  +  1  —  sl 


2nl1  __  2ir  (L, 

T  ~  J  t '' 


Hence  the  0i  integral  is  equivalent  to 
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The  s  integrations  are  carried  out  successively  with  the  aid  of  (69)  and 
(70)  starting  with  sn _ ,  and  ending  with  st.  Expression  (73)  reduces  to 


m  sfv  I--.8 . 

where 

2tt 

(81a)  st  =  s  +  =jj-(l,  +  i  +  “■  +  '»).  j  <  n, 

(81b)  sn  s  s,  p  s  pn , 


and  the  other  p' s  are  related  to  p  and  the  q's  by  (61).  Eq.  (80)  is  just 
the  0  dependent  factor  which  must  be  inserted  into  (63). 

At  this  stage  it  should  be  noted  that  one  can  very  well  collect  a 
complete  class  of  fragments  of  diagrams  together;  namely,  all  those 
which  are  torons  similar  to  Fig.  1 1  with  n  interaction  lines.  If  we 
insert  (80)  into  (63)  and  sum  over  all  N  from  1  to  oo,  we  find  a  single 
expression 

(82)  F(r\,  ra- •  • ;  ft,  ft  •  •  -ft)  =  f  F»(ru  ■  •  • ,  ft) 

1 


=  2  f  d3qr  ■  d3qnpl-n  &(qx  +  •••  +  qn)  +.+ 

(j  -  .-Ij  «  -  <fc  «/  w 


U.o 


X  •  -,</n'n)  fl  eXP  ^  *  +  Z^ft/ft) 

>  ”  1 


where 


(83) 
and 

(84) 


•  •  qjn)  =  •  •  •  J  d3p  ds  Mis)  n  jfry  + 

|'rew,/(l  +  Fermi  Dirac, 

w(/.r)  =  ize,'”/(l  —  :e,e‘)  Einstein  Bose, 

[zeW5  Maxwell  Boltzmann. 


The  Fermi  Dirac  form  of  Mis)  follows  immediately  from  (63)  and 
(82).  The  Einstein  Bose  case  follows  after  the  minus  signs  are 
removed  from  the  term  in  front  of  the  integral  sign  in  (63).  The 
Maxwell  Boltzmann  result  corresponds  to  the  case  N  =  1  in  (63), 
since  in  this  classical  limit  only  torons  of  order  one  occur. 
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Let  us  now  direct  ourselves  to  the  problem  of  combining  fragments 
such  as  Fig.  11  to  yield  complete  cluster  integrals.  The  free  particle 
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Figure  12 


(b) 


propagator  which  has  three  fixed  positions  (Fig.  12a)  differs  from  the 
propagator  of  a  fragment  with  three  fixed  positions,  one  at  the  end 
of  an  interaction  line  (Fig.  12b)  by  having  the  interaction  propagator 

U(p2  -  Pi) 

replaced  by 

(2nh)~3. 

This  follows  from  the  fact  that  the  integrations  over  r  in  (12b)  yields 
a  factor  (2 wA)3  (see  Eq.  51)  which  would  not  appear  in  (12a).  The 
interaction  line  of  (1 2b)  of  course  does  not  appear  in  (12a). 

Now  suppose  that  in  a  general  fragment  such  as  that  in  Fig.  1 1,  an 
interaction  line  is  omitted  and  an  unintegrated  point  is  left  on  the 
toron.  Then  the  appropriate  //(</)  in  (82)  would  be  replaced  by 
(2iri hy\ 

To  get  a  feel  for  what  is  involved  in  combining  fragments  to  produce 
complete  cluster  integrals,  we  start  by  combining  the  two  fragments 
(a)  and  (b)  of  Fig.  1 3  by  integrating  over  the  connecting  point  (r303). 
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Figure  13  Figure  14 

The  propagators  of  fragments  (a)  and  (b)  are  respectively 

/9~‘Z  f  [d3qid3q3Kq\  +  ^Matoi'i^a'a)!-^)]^)'3 

Ilia  j  j 

exp  /[(?,  r,  +  qar2)h~l  +  2ir(f,/3i  +  tapa)IP] 

=  /3'1  2  f d3q2  A2(—q2,  -t2\q2t2)[-i^qa))(lnhy3 

<a  » 

x  exp/[^2  (r2  -  rjA-1  +  2»r/2032  -  0,)//3] 

and 

£■‘2  -  ta ;  “(92)1(2^) ' 3 

'a  J 

X  exp  /[</2  •  (r3  -  r2)frl  +  2rrr2(03  -  /J3)//J). 

If  we  multiply  these  two  propagators  together  and  integrate  over  r2 
and  fi2,  we  obtain 

(85)  j3(2^)3  2  A2(~qa>  q3-  fa>l-«(?a)]* 

x  exp  i[qa  (ra  -  ry)/i  l  +  2jt/2(/33  -  /3,)/|8]. 

One  can  proceed  in  the  same  manner  to  show  that  a  chain  of  n 
torons  (Fig.  15a)  has  a  propagator 

1 


(86)  W,P;r\P)  = 


«(?))" 


2  -r.q.tM-, 

X  exp  i[q  (r'  -  r’fl-1  +  2rr/(jS’  -  /3)/j8]. 


r,**0 — O — O - 
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The  propagator  of  a  ring  of  n  particles  is  obtained  by  letting  r  =  r\ 
p"  =  P'  and  integrating  over  all  r  vectors  which  are  in  our  container 
of  volume  V  as  well  as  over  alt  fS  in  the  range  (0,  p).  We  then  obtain 

(87)  ^j"3  2  J ~Aa (-4’  ~t:th  0u(q)}\ 

A  combinatorial  factor  is  also  associated  with  the  ring  diagram. 
This  is  1/n!  multiplied  by  the  number  of  ways  a  ring  of  n  objects 
can  be  formed  from  n  objects,  ( n  -  1)!,  the  number  already  discussed 
in  connection  with  a  free  «-toron.  The  required  combinatorial  factor 
is  actually  (1/2K n  -  !)!/«!  =  l/2n.  The  factor  1/2  arises  because  in 
the  calculation  of  the  number  of  possible  rings  every  ordering  of 
particles  occurs  as  well  as  the  corresponding  reverse  ordering  while 
in  the  integration  over  all  possible  momentum  transfers  q  around  the 
ring  both  signs  of  q ,  +  and  -  occur.  The  -q  momentum  transfer 
in  a  given  ring  ordering  is  equivalent  to  the  +q  momentum  transfer 
in  the  reverse  ordering.  Hence  the  factor  1/2  must  be  inserted  to 
compensate  for  this  overcounting  of  a  configuration  by  a  factor  of  2. 
The  sum  over  all  rings  is  (counting  0»0  as  a  ring  of  two  torons): 

(88)  Sum  over  all  rings  =  2  J  ^ 

^  [-uiq)A2(-qu  —t:q,  t)]*ln 

n  2 

=  2(2 ^,2.  Ml) 

-  log  [I  4-  u(q)Aa(-q ,  -t:q,l)]}. 

The  simplest  diagrams  with  one  toron  acting  as  a  node  are  given  in 
Fig.  16.  Both  of  these  are  composed  of  two  chains  and  one  toron 
with  four  interaction  points.  In  Fig.  16a  our  required  propagator  is 
then  the  integral  over  (r,,  /),),  ■  •  (r4,  jS4)  of  the  product  of  two  chain 

propagators  (see  Eq.  86  for  full  expansion) 

■^KM2p2\ r ,/?,)  and  .^„3(r4/S4 ;  r3p3) 

and  one  toron  propagator  with  four  interaction  points: 

(iJfij5  2  J  -- J^V ■  d3q*Pl'a %qi  +  <7*  +  q*  +  ?4)s(I+...  +t4.o 

X  /l4(</,f,-  •  q<U)u(qi )uiqa) 

X  exp/^ft-r,  +  •  •  •  +  qA  rt)frl  +  j  (/,£,  +  •  •  •  + 
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(a) 


Figure  16 

After  multiplying  all  these  functions  together  the  integrations  over  the 
r' s  and  /3’s  yield  delta  functions  which  give  the  usual  conservation  of 
momentum  and  energy  equations.  One  finally  obtains 

*89)  !(2^j5  2 // 

x  A2*(  —  ?3~ t3'.q2t3)AA(  —  qi  — :qxi j :  ~q2  —  t2‘.q2t^). 

The  cases  in  which  nx  or  n2  are  equal  to  zero  are  “exchange  integrals” 
and  require  a  special  analysis  which  we  shall  not  make  here. 

The  reader  can  now  see  how  to  proceed  with  the  calculation  of  a 
general  diagram.  He  will  have  little  trouble  in  convincing  himself 
that  he  should  use  the  following  rules: 

(1)  For  each  toron  with  m  interaction  points  in  the  diagram, 
introduce  a  factor  /lm(<M, ;  <72  qmtm )  where  (<?„  r,),  •  •  •,  ( qm ,  tn ) 
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are  the  amounts  of  momentum  and  energy  lost  at  each  interaction 
point. 

(2)  For  each  interaction  line  of  momentum,  q,  introduce  the  factor 
-u{q). 

(3)  Introduce  a  factor  /J'-'  where  /  is  the  total  number  of  torons 
and  /  the  number  of  interaction  lines  in  the  diagram. 

(4)  Integrate  the  product  of  the  above  factors  over  all  momentum 
transfers  and  sum  over  all  energy  transfers  (energy  being  transferred 
in  quanta  of  magnitude  2w///3,  t  being  an  integer).  The  conservation 
laws  are  to  be  applied  at  each  interaction  point  on  a  toron. 

(5)  Introduce  a  factor  F/(2wA)3. 

(6)  Introduce  the  combinatorial  factor  0//12")  (number  of  ways 
diagram  can  be  drawn)  where  /  =  number  of  torons  and  n  is  the 
number  of  independent  paths  along  which  momentum  can  be  trans¬ 
ferred. 

The  factor  F(2tr6)~3  arises  from  the  fact  that  the  last  delta  function 
in  dealing  with  momentum  conservation  in  particle  propagators  is  to 
be  interpreted  (see  Eq.  23)  as  K8PlPa(2 nhy3.  The  physical  signifi¬ 
cance  of  the  V  is  that  all  interactions  depend  only  on  relative  coor¬ 
dinates.  Hence  in  the  configurational  form  of  the  partition  function, 
the  integration  over  the  position  of  the  last  particle  merely  contributes 
the  volume  of  the  container. 

The  origin  of  the  factor  j3' " '  is  as  follows.  In  the  (r,  p)  representa¬ 
tion  of  the  grand  partition  function,  the  ith  toron  propagator  of  a 
complex  diagram  composed  of  /  torons  has  the  P  containing  factor 
ni-i  (PIP"')  -  P1  21 ,  since  the  total  number  of  interaction  points, 
I is  twice  the  number  of  interaction  lines  /.  The  interaction  lines 
represent  isothermal  interactions,  both  ends  of  the  line  being  associated 
with  the  same  temperature.  Hence  in  the  final  p  integrations,  a 
factor  P  appears  for  each  line,  yielding  a  total  factor  p1.  When 
multiplied  by  the  above  p' ~2',  we  obtain  the  /S'"'  mentioned  in  rule  3. 

In  order  to  perform  detailed  calculations,  one  must  have  explicit 
expressions  for  the  /l*s  rather  than  their  integral  representations. 
Although  the  4-fold  integrations  (83)  are  relatively  straight-forward 
(at  low  temperatures  they  proceed  in  a  manner  similar  to  the  usual 
Summerfeld  method  for  integrals  which  appear  in  the  perfect  FD  or 
EB  gas)  the  required  expressions  are  quite  long  except  for  the  case 
n  =  2.  ,  A  detailed  investigation  of  these  integrals  is  contained  in  a 
forthcoming  review  of  the  statistical  mechanics  of  an  electron  gas. 
Here  we  merely  record  a  few  asymptotic  results  for  A2(-q,  -r,q,  t) 
(see  reference  15). 
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In  the  Maxwell  Boltzmann  statistics 


(90a) 


(90b) 


^a(-<7.  0 ;</,  0) 


v  (~  I)"n!(g3/3/2m)" 

P\  P  I  „40  (2 n  +  D! 


-  +  0(,>)]  as?-»0 

+ aS,^oo. 


If  t  #  0  and 

(91)  /M-*  I)  -  q2  lm(llq)i  (^)32  {1  +  0{\!q*[\  +  (L-/92)2])} 


(92)  r  =  2  nmtjp, 

when  either?2  >  2 mlP  or '(i*7^)a  >  2mlP. 

In  the  FD  case  as  /3-»x  for  fixed  12  and  F 


+  (F  +  /£>)*]  log 


KF  +  /g)  •+  /] 
I (V  +  iQ )  -  »] 


(93) 


+  H 


+  (V  -  iQ)2]  log 


\(Y  -  /(?)  -  01 
I (y-  iQ)  +  i)J 


where 


(94)  z  =  exp  -  plPI2m\  V  =  ;  and  (?  =  ql2p0. 

It  is  easily  shown  from  (93)  that 

f 4nmp0( I  —  Ftan  "1  K"1)  as  £)--»•  0, 

(95)  ,)~|4^2_v^ 

4.  Application  to  an  assembly  of  charged  particles.  We  now  apply 
the  general  formalism  described  above  to  the  statistical  mechanics  of 
a  gas  of  negative  ions  imbedded  in  a  continuum  of  positive  charge 
whose  charge  density  is  so  fixed  that  the  complete  assembly  is  electri¬ 
cally  neutral.  Since  our  formalism  is  a  systematic  perturbation 
theory  in  which  log  Za  is  presented  as  an  expansion  in  powers  of  the 
inter-particle  coupling  constant  (e2  in  our  case)  it  is  best  in  the  weak 
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coupling  range,  i.e.  e2  ->  0.  The  other  parameters  which  are  necessary 
to  describe  our  assembly  are  the  particle  mass,  m;  the  particle  density, 
/>;£=  llkT and  ft. 

Since  the  constant  h  is  absent  from  the  grand  partition  function 
appropriate  for  the  high  temperature  classical  limit,  this  function  must 
depend  only  on  the  single  dimensionless  parameter  which  can  be 
constructed  from  the  remaining  quantities,  pe2p113.  Hence  a  pertur¬ 
bation  expression  which  is  valid  for  small  e2  is  equally  valid  for  small 
/3  (high  temperature)  or  small  p  (density)  for  fixed  e 2. 

In  the  ground  state  (i.e.  /3  ->  oo,  zero  temperature)  the  parameter  /3 
disappears  from  various  formulae.  The  only  dimensionless  quantity 
which  can  be  constructed  from  the  remaining  parameters  is  me2lft2pl,a. 
Hence  in  the  ground  state  a  perturbation  expansion  in  e 2  is  appropriate 
for  high  densities  at  any  fixed  e. 


OO  © 

(a)  (b) 

Figure  17 


The  above  remarks  imply  that  if  we  derive  a  perturbation  correction 
to  a  perfect  gas  law  which  is  a  small  coupling  constant  approximation, 
it  is  equally  valid  as  a  high  temperature,  low  density  approximation 
or  a  low  temperature,  high  density  approximation.  . 

To  obtain  the  first  correction  to  the  perfect  gas  grand  partition 
function  we  calculate  the  terms  of  lowest  order  in  e2.  The  diagrams 
associated  with  terms  of  order  e 2  have  one  interaction  line.  They  are 
exhibited  in  Fig.  17.  It  turns  out  that  in  an  electrically  neutral 
assembly,  all  diagrams  which  can  be  decomposed  into  two  disconnected 
diagrams  by  cutting  a  single  interaction  line  (for  example.  Fig.  17a 
and  18(a)  and  (e))  vanish.  The  diagram  of  Fig.  17b  represents  what 
is  known  as  the  lowest  order  exchange  integral.  It  was  calculated 
many  years  ago  and  offers  no  difficulty.  We  do  not  discuss  it  here 
sjnce  4io  special  features  of  the  systematic  perturbation  theory  are 
required. 

The  next  order  terms  are  those  proportional  to  e4,  those  associated 
with  diagrams  with  two  interaction  lines.  They  are  exhibited  in 
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Fig.  18.  For  reasons  mentioned  above.  Fig.  18a  and  Fig.  18b  con¬ 
tribute  nothing  to  log  Za. 

0-0-0  0-0 

(a)  (b) 


0=0  ©  © 

(c)  (d)  (e) 

Figure  1 8 

The  contribution  of  18(c)  to  logZc  is  (employing  the  rules  of  the 
last  section) 

(%)  — £j3  £  jd3qu2(q)Al(-q, -t;q,t), 

where  (see  Eq.  45) 

u(q)  =  e2l2hir2q2. 

Note  that  when  i  =  0  in  the  MB  statistics  (Eq.  95) 

(97)  A\  -  r2/?2(2wm//3)3  2  +  0(q2) 

so  that  our  required  integral  (96)  diverges  in  the  small  q  range.  No 
difficulty  exists  in  the  large  q  range.  When  t  ¥=  0,  A\  =  0(q*). 
Hence  for  small  q  when  t  #  0,  d3q  u2{q)A\  =  0(q2)  dq  and  there  is 
no  divergence  problem.  In  the  FD  case  (Eq.  95)  A2  is  independent 
of  Q  as  Q  -  >■  0.  Hence  the  integral  in  (96)  again  diverges.  In 
summary,  after  working  so  hard  to  derive  a  systematic  perturbation 
theory,  we  find  that  the  first  perturbation  integrals  diverge  so  that  at 
first  glance  our  work  seems  for  nought.  Actually  the  divergence 
difficulty  was  known  (in  a  somewhat  different  form)  in  the  classical 
theory  of  electrolytes  some  forty  years  ago  as  well  as  in  the  FD  case 
of  an  electron  gas  in  a  metal  some  thirty  years  ago. 

Debye  and  Hiickel  avoided  the  divergence  by  employing  another 
method  of  the  calculation  of  the  thermodynamic  properties  of  electro¬ 
lytes.  However,  their  method  was  appropriate  only  for  very  dilute 


APPLICATION  TO  AN  ASSEMBLY  OF  CHARGED  PARTICLES  173 


solutions  and  could  not  be  extended  to  higher  concentrations.  J.  E. 
Mayer  made  the  correct  step  to  resolve  the  divergence  difficulty  in  the 
MB  case  and  Gell-Mann  and  Brueckner  independently  used  the  same 
idea  in  the  FD  case. 

Suppose  that  instead  of  fretting  over  our  troubles  in  the  calculation 
of  (96;,  we  go  on  to  the  next  order  nonexchange  contribution  to 
log  Z0,  the  ring  of  three  interacting  particles 


6(2  nh)3 


2  J d3q  u3(q)A\(-q<  -t:q,t). 


This  term -is  even  more  divergent  than  (96).  However,  the  sum  oyer 
all  rings  (Eq.  88) 


(98)  2(2nh)3  ^  -t:q,  l)u 


t)u(q) 

-  log  [1  +  u(q)Aa(-q,  - t:q ,  f)]} 


is  convergent.  For  example,  if  in  the  MB  case  we  examine  the  /  =  0 
term  and  approximate  Aa  by  :P(2nmiP)32,  we  have 


(99) 
where 

(100) 


108  (‘ +?)} 


zp(2nmlp)3l2e2 
2  htr2 


The  above  convergent  integral  has  the  value 

2-n2Va3'2  2rr2 V  (ze2ni\312  (2m\3li 
(  '  3( 2tt/j )■*  -  3(2tt/i)3  \  h  7  U/3/ 


Since  the  perfect  gas  contribution  (1-toron  contribution)  to  the 
log  Z0  is  (see  Eq.  47)  zV!h3(2vPjm)3'2,  the  total  contribution  of  the 
noninteracting  particle  graphs  plus  ring  graphs  to  logZc,  or  to 
PVjkT  is  (sccEqs.  48  and  36) 


(102) 


PV  zV  2n2V  l:e2m\32  /2m)34 

kT  -  log  A  -  p^nplmf  2  +  3(2nft)3  \  h  )  \np) 


while  z  and  the  density  p  are  related  through  (Eq.  49) 


(103)  P 


z  Hog  Zu  z  n2  /ze2m\3l2/2m\3li 

V~F:  h3(2npim)312  +  (2nh)3\~r)  Up) 
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Here  one  is  interested  in  z  as  ea  ->  0.  In  the  limit 

(104)  z0  =  pha(2vplm)aa. 

The  correction  82  to  this  value  when  e  is  small  but  finite  is  obtained 
by  substituting  (104)  into  the  second  term  on  the  right  hand  side  of 

(103)  and  solving  for  the  z  in  the  first  term  on  the  right  hand  side 

(104) .  One  obtains 


(105)  z  =  (2n)aiiha(plm)aiaP{\  -  e3/33  V'V/a  +  0(p)}. 


Substituting  this  into  (102)  we  find  the  Debye-Hiickel  equation  of 
state 

(106)  PlkT  =  Pjl  ~  I*1  V/33y/2j- 

The  high  pressure  ground  state  energy  of  an  electron  gas  is  obtained 
by  substituting  (93)  into  (98).  As  p  ->  oc  (T  0)  the  parameter 

V  =  2irtmlf}qp0 


becomes  a  continuous  variable  even  though  t  is  discrete.  .  The  sum¬ 
mation  with  respect  to  t  can  be  converted  to  an  integral  with  respect 
to  V.  Then  (98)  becomes  (after  letting  q  -  q  p0  and  ignoring  the 
prime  on  q  and  allowing  for  two  spin  states  by  replacing  u(q)A2  by 

2u(q)A3) 


(107) 


VplP 


m(2m 


e3A2(q,  V) 
hir3plq2 


ft^pW  J/ 


The  log  Zc  is  then,  to  the  ring  integral  approximation, 


(108)  logZc  =  log  Z{P’  +  (107) 

where  Z„)  is  the  grand  partition  function  of  a  perfect  Fermi  gas. 

The  internal  energy  is  found  by  differentiating  (108)  with  respect  to 
P  at  constant  2  =  exp  ppH2m\  the  density  by  differentiating  (108) 
with  respect  to  z  at  constant  p.  For  this  purpose  we  substitute 
p0  =  (2mP~'  log  2)l,a  into  (108)  before  carrying  out  our  differentia¬ 
tions.  The  details  of  the  differentiation  are  tedious  but  straight¬ 
forward.  They  have  been  given  in  some  detail  in  the  original  paper 
of  Montroll  and  Ward.  As  in  the  classical  case  we  assume  that 
2  =  z„  +  82  where  z0  is  now  chosen  to  be  exp  (Ppj/2m)  ( pF  being  the 
Fermi  momentum  H3nap)va.  Then  the  correction  Sz  is  found  to 
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keep  the  density  at  p.  Now  one  can  express  the  ground  state  energy 
(as  /}-  *•  qo  )  as 

£  =  £o(r)  +  S  E(z) 


-  £o(20)  +  Sz  +  S£(2o)}  + 

The  first  order  correction  to  the  free  particle  contribution  to  the 
ground  state  energy  has  two  parts,  one  being  the  direct  correlation 
energy  evaluated  at  z  -  z0  and  the  second  being  the  change  in  the 
free  particle  energy  due  to  the  fact  that  z  must  be  changed  to  yield  the 
proper  density.  One  finally  obtains  the  correlation  energy, 
[E  -  E0(:0)],  to  be 


009)  £corr 


-W 

8nma3r$ 


e2A2(q,  V) 

hn2pjq3 


eA3(q,  HI) 
h*2pjq2  Jj 


where  (4l3)nPrl  =  1  and  «  =  (4/9*)'  \  This  is  exactly  the  Gell- 
Mann-Brueckner  formula  for  the  correlation  energy  per  particle  of 
an  electron  gas.  A  detailed  discussion  of  the  integral  is  given  in  the 
Brueckner  lecture  notes  of  the  1958  Ecole  d’Ete  de  Physique  Theorique. 
Numerically  Brueckner  finds  (in  Rydbergs) 


£,  orr  =  -  0.096  +  0.0622  In  r, 

with  r,  =  r0!rH,  r„  being  the  Bohr  radius. 

The  physical  idea  behind  the  removal  of  the  small  q  divergence  is 
that  each  direct  interaction  between  two  particles  is  to  be  replaced 

by  the  sum  of  - + - O - +  — — O— O - +  •  •  • ,  the  direct 

interaction  plus  an  interaction  through  an  intermediate  particle  plus 
one  through  two  intermediates,  etc.  We  examine  the  consequence  of 
this  idea  in  the  classical  limit.  We  consider  only  the  terms  /  =  0, 
since  in  our  required  limit  only  these  terms  led  to  divergences.  Then 
we  should  replace  u(q)  by 

(110)  u(q)  -  A2(~q,  0;  q,  0)u2(q)  +  Alu3(q)-  -  u(q)l[  1  +  A2u(q)]. 

From  this  we  can  calculate  an  effective  potential  to  replace  the  coulomb 
potential  e2/r.  This  potential  would  be  just  the  Fourier  transform 
of  (110) 

e3  Cd3qcxp(iqrtrl)  e2 

2 tm2q2  J  [1  +  aq'3]q2  ~  7  CXP  "  " 


(Ilia) 
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where  1  Ik  is  the  so-called  Debye  length : 

(Nib)  *a  -  4irppe*. 

The  factor  exp  —*r  is  the  well-known  Debye  screening  factor. 

Now  it  can  be  shown  in  a  systematic  way,  by  rearranging  diagrams 
in  the  cluster  integral  expansion,  that  the  grand  partition  function 
can  be  expended  in  terms  of  cluster  integrals  involving  the  screened 
coulomb  potential.  Let  us  see  what  is  involved  in  calculating  in  the 
classical  limit  the  simplest  of  these  (see  Fig.  19).  In  the  figure  each 


CeO 

Figure  19 

bond  connecting  our  two  l-torons  is  the  screened  potential  rather 
than  e2/r.  We  perform  the  calculation  in  position  space.  First  we 
note  that  our  two  torons  at  the  ends  of  the  diagram  shrink  to  a  point 
in  the  limit  /9  ->  0,  since  each  toron  has  a  circumference  0.  We  also 
divide  by  1  /3 !  so  that  we  can  integrate  freely  over  the  /3's  at  interaction 
points  (see  remark  after  Eq.  32).  Note  that  the  free  particle  propa¬ 
gator  (19)  approaches  8(r’2  -  /•’,)  as  -  &)  -- 0.  Hence  all  inter¬ 
action  points  in  Fig.  19  coalesce  into  one  point  and  our  required 
integral  is 

-j;J  4irra  exp  -  Krj  dr-  J  r " 1  exp  (  -  3*r)  dr 

which  diverges.  We  eliminated  the  divergence  in  one  class  of  dia¬ 
grams  due  to  the  long  range  character  of  the  coulomb  interaction 
(small  momentum  transfer)  only  to  find  that  another  class  of  diver¬ 
gences  have  appeared  due  to  the  behavior  of  the  screened  coulomb 
potential  at  short  distances.  Here  we  must  be  brave  and  consider  the 
whole  class  of  diagrams  with  two  node  points 


o  ♦  o  ♦  ^ 


+ . 
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which  are  known  as  “watermelons'*  in  the  classical  limit  and  “  ladders” 
in  the  quantum  limit.  The  nth  of  these  is 

so  that  the  sum  over  all  of  them  (starting  with  the  third,  the  second 
merely  being  our  old  ring  integral) 

-4 n  f  ra/exp(-e2r"1  exp  -xr)  -  l  -f  —  (exp  -xr) 

Jo  l  r 

-  ~  (exp  -2*r)J*/r 

which  converges.  This  type  of  collection  of  diagrams  was  first  due 
to  Morita,  its  application  to  the  coulomb  interaction  problem  by  Abe. 
The  collection  of  ring  diagrams  was  first  made  by  Montroll  and  Mayer 
and  the  application  to  the  coulomb  interaction  problem  by  Mayer. 
We  do  not  proceed  further  here  to  find  the  actual  contribution  of  the 
watermelon  terms  to  log  Zc. 

Summary.  We  conclude  with  a  list  of  the  main  ideas  involved  in  the 
perturbation  theory  of  quantum  statistics  of  interacting  particles. 

1.  The  theory  starts  with  the  Green's  function  form  of  the  Bloch 
equation  which  is  in  turn  transformed  into  an  integral  equation  which 
is  solved  by  iteration;  the  «th  order  iteration  is  proportional  to  the 
nth  power  of  the  coupling  constant  which  characterizes  the  strength 
of  the  interparticle  interaction. 

2.  A  diagrammatic  representation  is  introduced  which  makes  it 
possible  to  identify  any  term  in  the  perturbation  expansion  without 
long  detailed  formulae. 

3.  The  logarithm  of  the  grand  partition  function  (from  which  all 
thermodynamic  quantities  can  be  calculated)  expressed  as  a  sum 
over  all  connected  diagrams  containing  any  number  of  interaction 
lines  and  particles.  This  form  can  be  characterized  in  the  following 
manner.  The  grand  partition  function  is  the  sum  over  all  possible 
elementary  events  on  our  (r,  0)  or  (p,  E)  cylinder  which  involve  any 
number  of  particles  and  quanta  of  interaction  provided  that  the 
events  are  such  that  no  free  end  exists  on  any  particle  or  interaction 
quanta  propagator  lines  and  that  all  diagrams  which  represent  the 
events  are  connected.  A  weight  rn  is  given  to  events  which  involve 
n  particles,  every  segment  of  a  diagram  is  represented  by  either  a  free 
particle  propagator  or  an  interaction  propagator.  The  only  elemen¬ 
tary  events  are  the  creation  and  annihilation  of  particles  or  quanta 
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of  interaction  with  energy  and  momentum  being  conserved  during 
the  occurrence  of  each  event.  Actually  one  could  start  with  these 
remarks  as  the  basic  postulates  of  statistical  mechanics  and  derive  all 
other  results  from  them. 

4.  Finally,  divergences  occur  in  the  evaluation  of  various  cluster 
integrals  which  are  associated  with  our  diagrams.  These  are  removed 
by  summing  over  various  classes  of  diagrams  and  performing  energy 
momentum  transfer  integrations  after  the  summation  is  performed. 
This  is  the  weak  point  of  the  theory  as  here  presented.  One  has  to 
rely  on  physical  plausibility  rather  than  mathematical  rigor  to  have 
confidence  in  the  results.  It  is  not  even  clear  that  all  divergences  can 
be  dealt  with  in  this  way ;  it  just  turns  out  that  in  the  weak  coupling 
limit,  the  first  few  terms  in  the  systematic  expansion  of  thermo¬ 
dynamic  functions  seem  to  work  out  well.  One  optimistically  con¬ 
tinues,  hoping  that  all  is  right  with  the  world. 
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